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The distribution of tropical trees across a landscape depends not only on how far seeds fall from a tree but 
also on competition between trees of the same species. 
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EDITORIAL 


Turning the tide on obesity? 


enetics, ultraprocessed foods, portion distortion, 
sweetened beverages, screen time, food addic- 
tion, intestinal microbiota, diet culture, weight 
stigma, food insecurity—all have been implicated 
in the “obesity epidemic.” More than a billion 
people worldwide have obesity, and many more 
are overweight. With the emergence of new, high- 
ly effective weight-loss drugs, might the “fat decades” be- 
come a closed chapter in the history of public health? 
Historically misattributed to a lack of willpower and 
the target of quackery, obesity was gradually reconcep- 
tualized by medicine as a disease, igniting the search for 
effective treatments. Until now, medicine has had little to 
offer and in fact has done harm. From dinitrophenol (cat- 
aracts and fatal hyperthermia) to fenfluramine-phenter- 
mine (valvular heart disease) to orlistat 
(fecal leakage), promising weight- 
loss medications were withdrawn or 
avoided because of adverse effects. 
Bariatric surgery became routine but 
is neither benign nor universally effec- 
tive. Weight cycling taxed bodies and 
minds, with emergent eating disorders 
attributed to repeated episodes of ca- 
loric restriction and appetite rebound. 
Physicians implored patients to lose 
weight but offered few effective tools. 
Meanwhile, the prevalence of 
obesity steadily advanced—dispro- 
portionately affecting marginalized 
racial, ethnic, and socioeconomic communities. In 
2023, the World Obesity Foundation predicted that the 
global economic impact of overweight and obesity will 
reach $4.32 trillion by 2035—if current trends continue. 
Personal cost is also enormous. Unlike many chronic 
diseases, obesity is visible. People try everything to lose 
weight—diets, weight loss and exercise programs, psy- 
chotherapy, devices, and surgeries. Repeated failures 
are the norm and evoke futility and shame. Advocacy 
and activist organizations abound that help people 
make peace with their large size in a society that stig- 
matizes, discriminates, and assigns personal blame. 
The medicalization of obesity has inspired extensive 
research explicating the complex network of biologi- 
cal systems ensuring that we eat to survive. Neuro- 
scientists mapped hunger and satiety brain circuits 
of genetically defined neurons whose activation can 
promote or inhibit eating in animals. Unraveling the 
biology of glucagon-like peptide 1 (GLP-1) enabled the 
development of GLP-1 receptor agonists, initially to 
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treatments 
exist for 
the first time.” 


treat type 2 diabetes but now revolutionizing treat- 
ment for obesity. By stimulating insulin release to 
normalize blood glucose concentration, slow gastric 
emptying, and promote and prolong satiety signals 
from gut to brain, these drugs bring about impressive 
weight loss. Patients also report a decrease in “food 
noise’—the nagging internal dialogue about food that 
makes many feel powerless to resist the urge to eat. 
Appetite decreases, satiety increases, and kilos seem- 
ingly melt away. Will these blockbuster drugs turn the 
global obesity tide? And if so, for everyone, or just an 
advantaged few? 

The “GLPIs” are expensive, and off-label use for weight 
loss will leave some with type 2 diabetes scrambling for 
their medications. Cost and supply constraints may fur- 
ther amplify health disparities. Those 
with insurance or resources to pay out 
of pocket will be the first to benefit. For 
others, these drugs will be out of reach. 

A boon for pharma but a challenge 
for patients is that the lost weight is 
regained when the medications are 
discontinued, functionally rendering 
them “forever” drugs. And it is un- 
clear whether weight regain and re- 
emerging food noise will outweigh the 
inconvenience of a weekly injectable 
and unpleasant side effects. Develop- 
ment of alternative delivery options 
and medications that preserve lean 
muscle may ensure continued use. 

Although standard safety profiles are favorable, what 
are the risks of taking these drugs for life? Does ha- 
bituation occur? Will they dissuade people from regu- 
lar exercise, which carries distinctive health benefits? 
What are the ramifications of repeated cycles of weight 
loss and regain due to medication discontinuation by 
choice, loss of insurance, or availability? And what is 
the abuse potential for extreme cosmetic purposes, for 
people with eating disorders, or for gaining advantage 
in sports where leanness matters? 

Tackling these issues is essential to prevent unin- 
tended consequences brought on by the meteoric suc- 
cess of these drugs. We are entering an era in which 
effective obesity treatments exist for the first time. 
Prevention efforts that address all of the factors that 
contribute to obesity must be bolstered, not abandoned, 
to ensure that the next generation will not require life- 
long medication to maintain metabolic health. 

—-Cynthia M. Bulik and J. Andrew Hardaway 
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Call for fossil’s return to Brazil 


aleontologists from Brazil and Europe have pub- 
lished an open letter asking the German state of 
Baden-Wurttemberg to return a fossil of the dino- 
saur Irritator challengeri (illustration above) to 
Brazil. The remnant of the spinosaurid’s skull was 
discovered in Brazil and purchased from a fossil 
digger by the State Museum of Natural History Stuttgart 
in 1991; researchers used it to describe the species in a 
1996 paper. But the authors of the open letter question 


&& Maybe there's some young Swifties out there that will | 
some day become seismologists or earth scientists. 99 


Seismologist Mouse Reusch, of the Pacific Northwest Seismic Network, to The New York 
Times, after recordings of seismological activity during a Taylor Swift concert in Seattle went viral. 


trials will enroll 100 to 300 people with 


oe = 


the legality of the export to Germany, pointing to a 1942 
Brazilian law saying any fossils found in the country be- 
long to that nation, and a 1990 decree that forbids the 
permanent export of fossils. The plea for repatriation 
comes weeks after Baden-Wiirttemberg returned an- 
other key fossil, that of the dinosaur Ubirajara jubatus, 
to a museum in Santana do Cariri in Brazil. The German 
state’s collections include hundreds of Brazilian fossils, 
the researchers say, and they hope all will be repatriated. 


normalize sleep patterns, researchers will 


New Long Covid clinical trials 


MEDICINE | The National Institutes of 
Health (NIH) this week announced clini- 
cal trials to test treatments against Long 
Covid, including drugs and other strate- 
gies targeting symptoms such as brain fog, 
rapid heart rate, and sleep troubles. Most 
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Long Covid and begin this year. Neurologic 
interventions will include a web-based 
brain training program and a device for 
transcranial stimulation. Scientists will also 
test intravenous immune globulin to treat 
immune abnormalities, as well as the drug 
ivabradine, which can lower heart rate. To 


test melatonin, light therapy, and some 
medications. The new trials are supported 
by RECOVER, NIH’s research effort into 
Long Covid, which launched in 2021. Its 
first trial, a study of the antiviral Paxlovid, 
includes a group that will take the drug for 
25 days and is currently enrolling patients. 
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Retired satellite guided home 


SPACE | The Aeolus spacecraft safely 
plunged toward Earth last week in the first 
ever guided re-entry of a retired satellite 
not engineered to end that way. Modern 
satellites must be designed for safe disposal, 
but Aeolus, a European Space Agency (ESA) 
mission to study wind by laser ranging, was 
built in the early 2000s before those regula- 
tions were in place. The spacecraft’s orbit 
would have naturally decayed, ending in a 
fiery atmospheric death. But to minimize the 
risk of debris reaching population centers, 
ESA decided to steer the re-entry using 
reserved fuel. Mission control brought the 
satellite down from a 320-kilometer altitude 
to 120 kilometers and, on 28 July, it entered 
the atmosphere on target over Antarctica 

so that any debris would plunge into the 
Atlantic Ocean. ESA hopes to dispose of 
other old satellites similarly. 


Essential medicine list updated 


PUBLIC HEALTH | The World Health 
Organization (WHO) has added treat- 
ments for multiple sclerosis to its list of 
essential medicines—but a new, popular 
class of obesity drugs didn’t make the cut. 
WHO's Model List of Essential Medicines 
is updated every 2 years and outlines 

the basic medications that should be 
consistently available and affordable in 
functioning health care systems. Last week, 
WHO released the latest version, which 
includes 502 treatments, after an expert 
committee reviewed dozens of proposals. 
The list includes multiple sclerosis drugs 
for the first time, as well as combo medica- 
tions for preventing cardiovascular disease 
and monoclonal antibodies for Ebola. But 
WHO denied a proposal to include GLP-1 
receptor agonists, longtime antidiabetic 
compounds now being used in weight loss 
drugs such as Wegovy, because of un- 
certainty about their long-term effects. It 
also rejected several patented cancer drugs, 
in part over concerns about affordability. 


Two-percent bump for NIH budget 


PoLicy | A Senate spending panel last 
week approved a modest budget increase 
of 2% for the National Institutes of Health 
(NIH). The bump to $47.8 billion roughly 
matches President Joe Biden’s request for 
the 2024 fiscal year, and includes increased 
funding for research on mental health, 
Alzheimer’s disease, and cancer. But the 
agency’s final budget remains uncertain: A 
corresponding House of Representatives bill 
would slash NIH’s budget by 6%, ban fund- 
ing for fetal tissue research and lab studies 
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unearthed in the 
ancient village of Ba’ja 
in southern Jordan. 


ARCHAEOLOGY 
Ancient necklace restored 


9000-year-old necklace discovered in a young girl’s grave in southern Jordan 

has been painstakingly restored to its former glory. After excavating a site known 

as Ba’ja in 2018, archaeologists set about putting together the 2500 pieces of 

stone and shell that make up the necklace, which had fallen apart over time. The 

reconstruction, presented this week in PLOS ONE, is an impressive example of 
Neolithic-era jewelry. It shows that the people of Ba’ja were skilled artisans who formed 
complex trade networks, the team writes: Materials in the necklace, such as turquoise, 
amber, and mother-of-pearl, would have been imported from far away. The girl's burial 
with such an elaborate ornament suggests to the researchers that her community 
loved and admired her, and grieved when she died. 


in China, and block grants to the nonprofit 
EcoHealth Alliance that collaborated with 
virologists in Wuhan, China. The House and 
Senate measures still need to be approved 
by their respective chambers, then recon- 
ciled into a final spending bill. 


WHO: Expand tobacco fight 


PUBLIC HEALTH | An estimated 8.7 million 
people still die annually from tobacco- 
related causes, and a new World Health 
Organization (WHO) report argues anti- 
tobacco measures should be more widely 
adopted. For example, just 25% of the 
world’s population lives in countries with 
complete bans on tobacco advertising, 


promotion, and sponsorship (TAPS), accord- 
ing to the report released this week. It finds 
that overall, 71% of the world’s population— 
5.6 billion people—are protected by at least 
one of eight antitobacco measures, which 
include public warnings, raising tobacco 
taxes, and requiring completely smoke-free 
indoor public places. WHO estimates that 
because of the measures, which it launched 
in 2005, some 300 million people who 
otherwise would have become smokers did 
not. Yet only about half of countries have 
smoke-free private workplaces and restau- 
rants, and 53 lack complete smoking bans in 
health care facilities. Richer countries also 
lag behind poorer ones in adopting TAPS 
bans, the report notes. 
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Side effects loom over Alzheimer’s drugs 


Despite landmark antibody approval, research into brain swelling and bleeding lags 


An MRI from a woman who died after receiving lecanemab in a trial shows extreme brain swelling (right), obscuring cerebral folds visible in a pretreatment scan (left). 


By Jennifer Couzin-Frankel 


sea change is underway in the treat- 

ment of Alzheimer’s disease, where for 

the first time a drug that targets the 

disease’s pathology and clearly slows 

cognitive decline has hit the U.S. mar- 

ket. A related therapy will likely be 
approved in the coming months. As many 
neurologists, patients, and brain scientists 
celebrate, they're also nervously eyeing com- 
plications from treatment: brain swelling 
and bleeding, which in clinical trials affected 
up to about one-third of patients and ranged 
from asymptomatic to fatal. 

The side effect—amyloid-related imaging 
abnormalities, or ARIA—remains mysteri- 
ous. “We don’t really understand what it is, 
what causes it, and what we can do about 
it,” says neurologist R. Scott Turner, direc- 
tor of the Memory Disorders Program at 
Georgetown University. 

The drugs triggering the problem are 
monoclonal antibodies that mop up beta 
amyloid, a protein that accumulates in the 
brains of people with Alzheimer’s. One such 
antibody, lecanemab, marketed as Leqembi 
by Eisai and Biogen, won full approval in the 
United States last month after trials showed 
it modestly slowed cognitive decline in early 
disease. Another, Eli Lilly & Co’s donanemab, 
had similar results and is under regulatory 
review. These therapies aim to clear clumps 
of amyloid, called plaques, from between 
neurons. But such plaques can also build up 
in brain blood vessels, and breaking them 
down there may set off ARIA. 

The condition comes in two forms: brain 
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swelling (ARIA-E for edema) and brain bleed- 
ing (ARIA-H for hemorrhage). Both show up 
in MRI and some patients also experience 
symptoms, which can include headaches, 
confusion, vomiting, and seizures. In the larg- 
est lecanemab trial, 21% of those on the drug 
had ARIA, and 3% had ARIA with symptoms. 
In donanemab’s biggest trial, 37% on therapy 
got ARIA and 6% had symptoms. 

These trials and others of similar anti- 
bodies have helped identify risk factors 
for ARIA, in particular having a gene vari- 
ant, APOE4, that also heightens risk of 
Alzheimer’s. Anticoagulants that treat or 
prevent blood clots appear to increase ARIA 
risk as well. 

As patients and families express enthu- 
siasm for the treatments, neurologists are 
worrying not only about serious ARIA, but 
the challenges that may accompany milder 
cases, which often call for pausing treatment 
and monitoring brain health with MRIs. 
“We've got to get our hands around this,’ says 
Walter Koroshetz, director of the National In- 
stitute of Neurological Disorders and Stroke 
(NINDS). Yet few researchers are studying 
ARIA. NINDS is now funding one ARIA- 
focused grant, but has new applications un- 
der review. Along with the National Institute 
on Aging (NIA), NINDS is also planning a 
meeting on ARIA next month to sketch out 
scientific priorities. 

Interest in ARIA blossomed 20 years ago, 
when antiamyloid antibodies were drawing 
increasing attention. In 2002, a brief report 
in Science described 10 mice engineered 
to develop Alzheimer’s-like symptoms and 
treated with antiamyloid therapy; plaques 


faded, but the mice had more frequent and 
severe brain hemorrhages than controls. “I 
call that the first-ever report of ARIA,’ says 
Donna Wilcock, a neuroscientist who directs 
the Center for Neurodegenerative Disorders 
at the Indiana University School of Medi- 
cine. She joined other scientists who began 
to study the complication. But as excitement 
grew about the therapies’ potential and they 
entered clinical trials, the basic science “just 
kind of dried up,’ she says. 

Wilcock has returned to ARIA research and 
holds the lone NINDS ARIA grant. She is ex- 
amining whether side effects in her antibody- 
exposed mice align with what most doctors 
believe occurs in people on treatment: ARIA- 
E, followed in some cases by ARIA-H. 

Wilcock and others think a condition 
called cerebral amyloid angiopathy (CAA), 
a buildup of amyloid in brain blood vessels 
that’s common in older people with Alzheim- 
er’s, helps set the stage for ARIA. CAA can 
lead to small brain hemorrhages; in the drug 
trials, the rate of ARIA-H alone was roughly 
the same in the treated and placebo groups. 
CAA also raises risk of ARIA for those on anti- 
amyloid antibodies, and the U.S. Food and 
Drug Administration recommends that peo- 
ple with signs of CAA exercise caution before 
taking lecanemab. But CAA isn’t always easy 
to detect on MRIs, especially when it’s mild. 

What happens, Wilcock suspects, is that as 
the antibodies break down amyloid depos- 
its in blood vessels, they activate microglia. 
These cells support brain repair and likely 
help antibodies clear plaques, but they can 
also cause inflammation. Such inflammation 
could prompt blood vessels to leak, produc- 
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ing swelling. In the trials, brain swelling, 
especially with symptoms, was far more 
common in the treatment than the placebo 
groups. Blood vessel ruptures can also cause 
bleeding. But, “There’s a lot of what-ifs” in 
this cascade of events, Wilcock acknowl- 
edges, and there are other theories, too. 

She and other scientists suspect an un- 
common complication of CAA, called CAA- 
related inflammation (CAA-ri), may offer 
further insight. Here, people not taking 
the antibodies spontaneously develop what 
looks like ARIA-E with symptoms. Last year, 
Fabrizio Piazza, a translational researcher at 
the University of Milano-Bicocca, reported 
in Neurology that in patients with CAA-ri, 
microglia are activated in areas with brain 
swelling. That finding, he says, supports the 
idea that these cells may contribute to treat- 
ment-related ARIA. 

Piazza also found spontaneously occurring 
antiamyloid antibodies in the cerebrospinal 
fluid (CSF) of people with CAA-ri. He’s setting 
up a global registry of CAA-ri patients to see 
whether these CSF markers can help predict 
and track ARIA-E, given that abnormalities 
on MRIs don’t always re- 
flect symptoms. 

For now, the risk of 
complications raises many 
questions for doctors and 
patients, some of them life 
or death. At Northwestern 
University, chief of neu- 
rocritical care Sherry Chou 
is struggling with what 
she knows is an inevitable 
scenario: A patient on an- 
tibody treatment for early 
Alzheimer’s is having a stroke and urgently 
needs a clot-busting drug to stop brain cells 
dying en masse. Withholding the drug could 
mean devastating brain damage, but giving 
it may raise the risk of catastrophic hemor- 
rhage. In February, Chou and her colleagues 
published the case of awoman she helped care 
for, who was taking Leqembi through a trial, 
received this stroke treatment, and died days 
later after suffering significant hemorrhages 
(Science, 14 April, p. 122). 

Given that about 800,000 people in the 
United States have a stroke every year, Chou 
anticipates “natural experiments” where doc- 
tors make emergency medical decisions with- 
out data. “The research is just getting started, 
but the drug is already in clinical use,’ Chou 
says. “What happens now?” 

Another quandary is when or whether to 
return a patient to antibody treatment af- 
ter stopping it because of ARIA. “There’s a 
lot of guesswork” and a need for more data, 
Turner says. MRIs may be reassuring, but the 
risk of a repeat scenario could remain. In the 
donanemab trial, one participant died from 
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ARIA months after an episode apparently re- 
solved and therapy restarted. 

Such guesswork is complicated by an- 
other mystery: “I want to know what, if 
anything, ARIA did to a patient’s cognition 
at the end of the clinical trial,” especially 
in serious cases, says Madhav Thambisetty, 
chief of the clinical and translational neuro- 
science section at NIA and a neurologist at 
Johns Hopkins University. One concern is 
whether some patients with marked ARIA 
might have worsening memory or other 
cognitive functions by the trial’s end. Data 
on that question exist but the companies 
haven’t shared them, Thambisetty says. 

Doctors also want to know how best to 
treat ARIA and whether it can be prevented. 
Again, CAA-ri may provide clues. Steven 
Greenberg, a vascular neurologist and di- 
rector of the Hemorrhagic Stroke Research 
Program at Massachusetts General Hospi- 
tal, recently found that CAA-ri patients with 
brain swelling reminiscent of ARIA-E gener- 
ally had quick improvement and a reduced 
chance of relapse when treated with high- 
dose steroids, a treatment also used for 
severe ARIA. Depending 
on how steroids work in 
CAA-ri, Greenberg even 
wonders whether they 
might be given alongside 
antiamyloid therapy or at 
the earliest sign of ARIA 
to head off severe effects; 
such strategies would 
need testing, he stresses. 

Researchers are espe- 
cially eager to help people 
with two copies of APOE4. 
They are at higher risk for ARIA but also 
develop Alzheimer’s at a younger age. 
Many doctors are unsure about prescrib- 
ing the antibodies to these patients (lec- 
anemab data suggest they may also benefit 
less than patients without APOE4), and the 
Department of Veterans Affairs has said 
it won’t typically cover antiamyloid treat- 
ment for them, but will consider requests 
for exceptions. But, “Just saying, ‘Oh, 
sorry,” isn’t good enough, Wilcock says. 
“We need to figure this out so we can treat 
those people without the risk.” 

Ultimately, the risk calculus will be per- 
sonal. Nancy Meserve, a retired special 
education administrator who carries two 
copies of APOE4, is enrolled in a trial to see 
whether lecanemab can prevent Alzheim- 
er’s. She doesn’t know whether she’s on 
the drug or the placebo, but based on side 
effects suspects the drug; she also devel- 
oped ARIA-H with mild symptoms, which 
resolved. “People take risks with things that 
are important to them,” Meserve says. “To 
me, this was important.” 
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Ship fuel rules 
have altered 
clouds and 
warmed waters 


An unprecedented test of 
reverse geoengineering 
unfolds above the oceans 


By Paul Voosen 


he Atlantic Ocean is running a fever. 

Waters off Florida have become a hot 

tub, bleaching the third-largest bar- 

rier reef in the world. Off the coast 

of Ireland, extreme heat was impli- 

cated in the mass death of seabirds. 
For years, the north Atlantic was warming 
more slowly than other parts of the world. 
But now it has caught up, and then some. 
Last month, the sea surface there surged to 
a record 25°C—nearly 1°C warmer than the 
previous high, set in 2020—and tempera- 
tures haven’t even peaked yet. “This year 
it’s been crazy,’ says Tianle Yuan, an atmo- 
spheric physicist at NASA’s Goddard Space 
Flight Center. 

The obvious and primary driver of this 
trend is society’s emissions of greenhouse 
gases, which trap heat that the oceans 
steadily absorb. Another influence has 
been recent weather, especially stalled 
high-pressure systems that suppress cloud 
formation and allow the oceans to bake in 
the Sun. 

But researchers are now waking up to an- 
other factor, one that could be filed under 
the category of unintended consequences: 
disappearing clouds known as ship tracks. 
Regulations imposed in 2020 by the United 
Nations’s International Maritime Organiza- 
tion (IMO) have cut ships’ sulfur pollution 
by more than 80% and improved air quality 
worldwide. The reduction has also lessened 
the effect of sulfate particles in seeding and 
brightening the distinctive low-lying, reflec- 
tive clouds that follow in the wake of ships 
and help cool the planet. The 2020 IMO rule 
“Is a big natural experiment,” says Duncan 
Watson-Parris, an atmospheric physicist at 
the Scripps Institution of Oceanography. 
“We're changing the clouds.” 

By dramatically reducing the number 
of ship tracks, the planet has warmed up 
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ing swelling. In the trials, brain swelling, 
especially with symptoms, was far more 
common in the treatment than the placebo 
groups. Blood vessel ruptures can also cause 
bleeding. But, “There’s a lot of what-ifs” in 
this cascade of events, Wilcock acknowl- 
edges, and there are other theories, too. 

She and other scientists suspect an un- 
common complication of CAA, called CAA- 
related inflammation (CAA-ri), may offer 
further insight. Here, people not taking 
the antibodies spontaneously develop what 
looks like ARIA-E with symptoms. Last year, 
Fabrizio Piazza, a translational researcher at 
the University of Milano-Bicocca, reported 
in Neurology that in patients with CAA-ri, 
microglia are activated in areas with brain 
swelling. That finding, he says, supports the 
idea that these cells may contribute to treat- 
ment-related ARIA. 

Piazza also found spontaneously occurring 
antiamyloid antibodies in the cerebrospinal 
fluid (CSF) of people with CAA-ri. He’s setting 
up a global registry of CAA-ri patients to see 
whether these CSF markers can help predict 
and track ARIA-E, given that abnormalities 
on MRIs don’t always re- 
flect symptoms. 

For now, the risk of 
complications raises many 
questions for doctors and 
patients, some of them life 
or death. At Northwestern 
University, chief of neu- 
rocritical care Sherry Chou 
is struggling with what 
she knows is an inevitable 
scenario: A patient on an- 
tibody treatment for early 
Alzheimer’s is having a stroke and urgently 
needs a clot-busting drug to stop brain cells 
dying en masse. Withholding the drug could 
mean devastating brain damage, but giving 
it may raise the risk of catastrophic hemor- 
rhage. In February, Chou and her colleagues 
published the case of awoman she helped care 
for, who was taking Leqembi through a trial, 
received this stroke treatment, and died days 
later after suffering significant hemorrhages 
(Science, 14 April, p. 122). 

Given that about 800,000 people in the 
United States have a stroke every year, Chou 
anticipates “natural experiments” where doc- 
tors make emergency medical decisions with- 
out data. “The research is just getting started, 
but the drug is already in clinical use,’ Chou 
says. “What happens now?” 

Another quandary is when or whether to 
return a patient to antibody treatment af- 
ter stopping it because of ARIA. “There’s a 
lot of guesswork” and a need for more data, 
Turner says. MRIs may be reassuring, but the 
risk of a repeat scenario could remain. In the 
donanemab trial, one participant died from 
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“The research 
is just getting started, 
but the drug is already 
in clinical use. ... 


What happens now?” 


Sherry Chou, 
Northwestern University 


ARIA months after an episode apparently re- 
solved and therapy restarted. 

Such guesswork is complicated by an- 
other mystery: “I want to know what, if 
anything, ARIA did to a patient’s cognition 
at the end of the clinical trial,” especially 
in serious cases, says Madhav Thambisetty, 
chief of the clinical and translational neuro- 
science section at NIA and a neurologist at 
Johns Hopkins University. One concern is 
whether some patients with marked ARIA 
might have worsening memory or other 
cognitive functions by the trial’s end. Data 
on that question exist but the companies 
haven’t shared them, Thambisetty says. 

Doctors also want to know how best to 
treat ARIA and whether it can be prevented. 
Again, CAA-ri may provide clues. Steven 
Greenberg, a vascular neurologist and di- 
rector of the Hemorrhagic Stroke Research 
Program at Massachusetts General Hospi- 
tal, recently found that CAA-ri patients with 
brain swelling reminiscent of ARIA-E gener- 
ally had quick improvement and a reduced 
chance of relapse when treated with high- 
dose steroids, a treatment also used for 
severe ARIA. Depending 
on how steroids work in 
CAA-ri, Greenberg even 
wonders whether they 
might be given alongside 
antiamyloid therapy or at 
the earliest sign of ARIA 
to head off severe effects; 
such strategies would 
need testing, he stresses. 

Researchers are espe- 
cially eager to help people 
with two copies of APOE4. 
They are at higher risk for ARIA but also 
develop Alzheimer’s at a younger age. 
Many doctors are unsure about prescrib- 
ing the antibodies to these patients (lec- 
anemab data suggest they may also benefit 
less than patients without APOE4), and the 
Department of Veterans Affairs has said 
it won’t typically cover antiamyloid treat- 
ment for them, but will consider requests 
for exceptions. But, “Just saying, ‘Oh, 
sorry,” isn’t good enough, Wilcock says. 
“We need to figure this out so we can treat 
those people without the risk.” 

Ultimately, the risk calculus will be per- 
sonal. Nancy Meserve, a retired special 
education administrator who carries two 
copies of APOE4, is enrolled in a trial to see 
whether lecanemab can prevent Alzheim- 
er’s. She doesn’t know whether she’s on 
the drug or the placebo, but based on side 
effects suspects the drug; she also devel- 
oped ARIA-H with mild symptoms, which 
resolved. “People take risks with things that 
are important to them,” Meserve says. “To 
me, this was important.” 


“® 


ATMOSPHERIC SCIENCE | Check for 


updates 


Ship fuel rules 
have altered 
clouds and 
warmed waters 


An unprecedented test of 
reverse geoengineering 
unfolds above the oceans 


By Paul Voosen 


he Atlantic Ocean is running a fever. 

Waters off Florida have become a hot 

tub, bleaching the third-largest bar- 

rier reef in the world. Off the coast 

of Ireland, extreme heat was impli- 

cated in the mass death of seabirds. 
For years, the north Atlantic was warming 
more slowly than other parts of the world. 
But now it has caught up, and then some. 
Last month, the sea surface there surged to 
a record 25°C—nearly 1°C warmer than the 
previous high, set in 2020—and tempera- 
tures haven’t even peaked yet. “This year 
it’s been crazy,’ says Tianle Yuan, an atmo- 
spheric physicist at NASA’s Goddard Space 
Flight Center. 

The obvious and primary driver of this 
trend is society’s emissions of greenhouse 
gases, which trap heat that the oceans 
steadily absorb. Another influence has 
been recent weather, especially stalled 
high-pressure systems that suppress cloud 
formation and allow the oceans to bake in 
the Sun. 

But researchers are now waking up to an- 
other factor, one that could be filed under 
the category of unintended consequences: 
disappearing clouds known as ship tracks. 
Regulations imposed in 2020 by the United 
Nations’s International Maritime Organiza- 
tion (IMO) have cut ships’ sulfur pollution 
by more than 80% and improved air quality 
worldwide. The reduction has also lessened 
the effect of sulfate particles in seeding and 
brightening the distinctive low-lying, reflec- 
tive clouds that follow in the wake of ships 
and help cool the planet. The 2020 IMO rule 
“Is a big natural experiment,” says Duncan 
Watson-Parris, an atmospheric physicist at 
the Scripps Institution of Oceanography. 
“We're changing the clouds.” 

By dramatically reducing the number 
of ship tracks, the planet has warmed up 
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Crisscrossing clouds known as ship tracks can be seen off the coast of Spain in this 2003 satellite image. With 


az 


the phasing out of high-sulfur ship fuel, these reflective clouds have become scarcer, leading to ocean warming. 


faster, several new studies have found. That 
trend is magnified in the Atlantic, where 
maritime traffic is particularly dense. In the 
shipping corridors, the increased light rep- 
resents a 50% boost to the warming effect 
of human carbon emissions. It’s as if the 
world suddenly lost the cooling effect from 
a fairly large volcanic eruption each year, 
says Michael Diamond, an atmospheric sci- 
entist at Florida State University. 

The natural experiment created by the 
IMO rules is providing a rare opportu- 
nity for climate scientists to study a geo- 
engineering scheme in action—although 
it is one that is working in the wrong di- 
rection. Indeed, one such strategy to slow 
global warming, called marine cloud bright- 
ening, would see ships inject salt particles 
back into the air, to make clouds more re- 
flective. In Diamond’s view, the dramatic 
decline in ship tracks is clear evidence that 
humanity could cool off the planet signifi- 
cantly by brightening the clouds. “It sug- 
gests pretty strongly that if you wanted to 
do it on purpose, you could,” he says. 

The influence of pollution on clouds 
remains one of the largest sources of un- 
certainty in how quickly the world will 
warm up, says Franziska Glassmeier, an at- 
mospheric scientist at the Delft University 
of Technology. Progress on understanding 
these complex interactions has been slow. 
“Clouds are so variable,” Glassmeier says. 

Some of the basic science is fairly well 
understood. Sulfate or salt particles seed 
clouds by creating nuclei for vapor to con- 
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dense into droplets. The seeds also brighten 
existing clouds by creating smaller, more 
numerous droplets. The changes don’t stop 
there, says Robert Wood, an atmospheric 
scientist at the University of Washing- 
ton. He notes that smaller droplets are 
less likely to merge with others, poten- 
tially suppressing rainfall. That would in- 
crease the size of clouds and add to their 
brightening effect. But modeling also sug- 
gests that bigger clouds are more likely 
to mix with dry air, which would reduce 
their reflectivity. 

Even before the IMO regulations, ship 
tracks have been a target for researchers 
to test these ideas. Given their striking ap- 
pearance, these linear clouds were a natural 
candidate for artificial intelligence-based 
image recognition, Yuan says. Using such 
techniques, and 2 decades of calibrated im- 
agery from NASA’ ailing Terra and Aqua 
satellites, Yuan and co-authors discovered 
10 times more ship tracks than previously 
identified using manual techniques. In 
their study, published last year in Science 
Advances, they also found these tracks de- 
creased by more than 50% in the main ship- 
ping corridors after the IMO regulations. 

In more recent work, they take this 
analysis a step further, calculating the 
amount of cooling associated with the 
tracks’ brightening effect and the way 
the pollution extended the lifetime of the 
clouds. IMO rules have warmed the planet 
by 0.1 watts per square meter—double the 
warming caused by changes to clouds by 


airplanes, they conclude in a paper under 
review. The impact is magnified in regions 
of heavy shipping, like the north Atlantic, 
where the disappearing clouds are “shock 
to the system,” Yuan says. The increase in 
light, which was worsened by a lack of re- 
flective Saharan dust over the ocean this 
year, “can account for most of the warm- 
ing observed” in the Atlantic this summer, 
he says. 

Instead of focusing on visible tracks, 
Watson-Parris and his colleagues started 
with ship location data, combining those 
coordinates with weather records to proj- 
ect where the ships’ pollution traveled. 
They compared clouds at these locations 
with nearby clouds free of any ship pollu- 
tion. In Nature last year, they reported that 
these “invisible” ship tracks not only en- 
hanced low lying marine clouds, as usual, 
but also markedly increased the volume of 
puffy cumulus clouds higher in the atmo- 
sphere, previously thought to be immune 
to the influence of ship pollution. They 
concluded that air pollution could be caus- 
ing clouds to cool the climate at roughly 
double the previously projected strength. 

However, when the team then looked 
at the effect of the IMO rules on these in- 
visible tracks, they received a shock: The 
decline in pollution didn’t make the cumu- 
lus clouds any less puffy, they report in a 
new preprint in Atmospheric Chemistry 
and Physics (ACP). It suggests these clouds 
have a saturation point, after which added 
pollution does little to increase their depth, 
Watson-Parris says. “We removed 80% of 
the aerosols, but that’s still not taking us 
close to the preindustrial state.” 

A third way to explore the impact of ship 
pollution on clouds is not to study them 
in aggregate, but rather to zoom in on 
ocean stretches where winds flow in paral- 
lel with shipping lanes, keeping the pollu- 
tion tightly corralled. Such a stretch exists 
in the southeast Atlantic, off the coast of 
Angola. Observing this region with the 
Terra satellite, Diamond found that, with 
lower pollution, the cloud droplet sizes 
had grown to the largest size, by far, in the 
past 2 decades. Extrapolating from there, 
Diamond estimates in an paper last week 
in ACP that the IMO rules have caused 
warming globally at levels like those seen 
by Yuan. 

Later this year, Diamond, Yuan, and oth- 
ers will begin to compare their techniques 
for studying the interaction of pollution 
and clouds, under the auspices of the Na- 
tional Oceanic and Atmospheric Admin- 
istration’s small geoengineering research 
program. After a few more years, Wood 
says, “We’re really going to have something 
to say about these cloud adjustments.” 
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Study probes link between heat and miscarriage 


Temperature sensors are tracking exposure in people trying to conceive 


By Meredith Wadman 


s Marissa Friedman walked her dog 

early on the morning of 31 July, the 

temperature in Roswell, Georgia, was 

on its way to a high of nearly 35°C 

and it was already humid. Tagging 

along with her was a temperature and 
humidity monitor on a lanyard around her 
neck, silently recording these details. 

Friedman, 31, is one of more than 
100 women who have so far enrolled in a 
new study examining whether heat exposure 
might cause miscarriages. For Friedman, who 
has been trying to get pregnant, the question 
“Gs definitely a source of morbid curiosity.” 

It’s also an urgent one, as climate change 
promises more record-breaking tempera- 
tures. The study, launched in May and run 
by Amelia Wesselink, a reproductive and 
environmental epidemiologist at Boston 
University, aims to pin down the relation- 
ship between heat and pregnancy loss, and 
to determine whether there are certain 
weeks during development when the em- 
bryo is at greatest risk. The results might 
guide behavior changes, clinical reeommen- 
dations, and public policies to protect preg- 
nant people from heat. 

Wesselink and her colleagues hope to re- 
cruit 400 female participants in the United 
States and Canada who are trying to get preg- 
nant and are new enrollees in a large, long- 
standing online study called PRESTO. That 
study examines many factors that could in- 
fluence fertility and whether a pregnancy is 
successfully carried to term. The participants 
will self-report if and when they get pregnant 
and have a baby—or miscarry. 

Wesselink’s team recently analyzed 
PRESTO data from 2013 to 2020 and found 
a peak in pregnancy loss in late summer. The 
risk of miscarrying was 30% higher in late 
August than in late February, they reported 
in a 2022 paper in Epidemiology. It was 40% 
higher when counting only pregnancy losses 
before 8 weeks of gestation—the period when 
most losses occur. 

“That got us thinking: Could heat be play- 
ing a role here?” Wesselink recalls. “We dug 
into the literature and there just really hasn’t 
been a good prospective study.’ So this sum- 
mer and next, study participants will wear 
their temperature and humidity monitors 
continuously for 1 month. Humidity af- 
fects the experience of heat, Wesselink says; 
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higher humidity might trigger miscarriage at 
lower temperatures. 

Previous studies of heat and miscarriage 
have looked backward in time, relying on air 
temperature measured at the weather sta- 
tion nearest to a participant’s home. But the 
personal monitors will let researchers follow 
the women wherever they go, indoors and 
out. Sherri Davis, a 34-year-old study par- 
ticipant in suburban Detroit, has clipped her 
sensor to her purse, set it on her desk at the 
office, and kept it on a nearby patio table as 


Marissa Friedman wears a heat sensor on a recent walk. 


she chased her nearly 3-year-old daughter 
around the backyard. Miscarriage is com- 
mon in her family, and she has been trying 
to conceive a second child. “Anything I feel 
like I can do to help research progress faster, 
I want to do,” she says. 

Wesselink’s team will pair sensor data 
with results from an extensive questionnaire 
the participants answer on factors such as 
neighborhood green space, how much time 
they spend outside, and whether they have 
air conditioning. The combined data will 
inform a predictive model of personal heat 
exposure that can then be used in the full, 
17,000-plus group of PRESTO participants 
to examine the link between miscarriage 
and heat with much more power. 

“This is an impressive research design,’ 
says Tamas Hajdu, an economist at the Cen- 
tre for Economic and Regional Studies. His 
own 2021 analysis of decades of pregnancy 


and weather records from Hungary esti- 
mated that, on a day with an average tem- 
perature of 25°C, one more woman in every 
half-million lost a pregnancy during the first 
6 weeks of gestation, compared with days 
with an average temperature of 15°C to 20°C. 
The personal sensors in the new study offer 
“a more accurate assessment of the effect of 
heat,” he says. 

Having granular personal data from sen- 
sors, adds Niamh Forde, a reproductive bio- 
logist at the University of Leeds, “could be 
really powerful in helping us understand: 
When [during a pregnancy] is the heat stress 
most critical? When do we want to try and 
avoid it?” 

Animal research supports a link between 
heat and miscarriage—particularly studies in 
cattle, whose reproductive physiology closely 
mimics that of humans. One study of cows 
in Spain found that the rate of miscarriage 
during the second and third months of gesta- 
tion was 2% from October to April versus 12% 
during the much warmer period from May 
to September. 

Several factors may be at play in cattle, 
and people, says Peter Hansen, a reproduc- 
tive biologist at the University of Florida. 
Heat stress may shunt blood away from the 
uterus to the skin, compromising the pla- 
centa’s nourishing and protective functions. 
Heat could also directly damage either the 
placenta or the embryo. And high tempera- 
tures during gestation are associated with 
changes to the fetal epigenome—chemicals 
that influence which genes are off or on. 
Such changes could conceivably “doom” 
some embryos before birth, Hansen says. 

Probing such mechanisms in people is 
ethically and logistically difficult, Wesselink 
notes. And because her group is not collect- 
ing biological samples, her project won't 
be able account for the possible role of a 
major cause of pregnancy loss: abnormal 
chromosome counts in the embryo, which, 
theoretically, could occur more often with 
rising temperatures. 

Audrey Gaskins, a reproductive epidemio- 
logist at Emory University, worries that with 
a small group of participants wearing sensors 
for just 1 month, the study may not pick up 
many pregnancy losses in particularly risky 
periods, such as the 2 weeks after conception. 
Still, “Itll definitely add information,’ she 
says. “I will be waiting on pins and needles 
to see what they find.” & 
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Move to list monkey as endangered spurs dispute 


U.S. group challenges decision to reclassify macaque widely used in biomedical research 


By Dennis Normile 


U.S. science advocacy group is weigh- 

ing in on a conservation debate about 

monkeys in Southeast Asia. The Na- 

tional Association for Biomedical 

Research (NABR) has asked the In- 

ternational Union for Conservation of 
Nature (IUCN) to reverse its 2022 decision to 
shift the long-tailed macaque (Macaca fascic- 
ularis), an animal widely used in biomedical 
research, from vulnerable to endangered on 
its Red List of Threatened Species. 

The IUCN decision was based on an as- 
sessment that “contains numerous errors 
and misstatements, and does not provide 
actual evidence of species declines,’ NABR 
President Matthew Bailey argued in a 15 June 


“hyperabundant’—in some regions. In some 
places they are considered pests and culled. 
IUCN, an independent organization, does 
not directly regulate the global wildlife trade. 
But its Red List carries weight with national 
and international bodies that do set import 
and export rules, including the Convention 
on International Trade in Endangered Spe- 
cies of Wild Fauna and Flora (CITES). The 
184 nations that have joined CITES, for ex- 
ample, require buyers and sellers to obtain 
permits to trade in long-tailed macaques. 
The US. generally allows macaques to be 
imported for use in biomedical research only 
if they have been bred in captivity. This is to 
protect wild populations and to ensure lab 
animals are pathogen-free. Last year, how- 
ever, the U.S. government charged several 


Researchers disagree on whether the long-tailed macaque, a mainstay of laboratory research, is endangered. 


statement. NABR worries the change could 
lead to restrictions on importing macaques 
for research. The species, also known as 
the crab-eating macaque and cynomolgus 
monkey, accounted for more than 95% of 
the roughly 33,000 nonhuman primates im- 
ported into the United States in 2022. 
NABR’s request has triggered IUCN’s re- 
view process and highlighted a debate over 
the macaque’s conservation status. Some re- 
searchers welcomed the endangered listing, 
given rising threats from hunters, habitat de- 
struction, and poachers capturing the mon- 
keys for trade. But it surprised others. “IUCN 
is off base here,” says ecologist William Laur- 
ance of James Cook University. A recent study 
concludes macaques are abundant—even 
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monkey suppliers with falsely labeling ma- 
caques caught in the wild as captive-bred 
(Science, 2 December 2022, p. 934). 

Such poaching is one reason IUCN listed 
the macaque as endangered. Its assessment, 
by anthropologist Malene Hansen of Prince- 
ton University and 17 co-authors, cited a 2006 
study that estimated the global population de- 
clined from 5 million in the 1980s to 3 million 
in the 2000s. Since then threats have grown 
and, over the next 3 macaque generations, 
“we suspect the species will experience at 
least a 50% decline,” the team wrote. “[It] 
may be facing irreversible population loss.” 

That conclusion sounds right to primate 
ecologist Kurnia Ilham of Andalas University, 
who studies macaque troops on Sumatra in 


Indonesia. He believes populations will de- 
cline “if the situation is not improved.” 

But wildlife managers in some areas say 
they have become pests. In Malaysia alone of- 
ficials killed more than 97,000 macaques in 
2022, according to reports, spurred by com- 
plaints about the animals biting people and 
raiding garbage dumps and crops. 

A recent study supports the idea that ma- 
caques are thriving in some areas. The paper, 
published on 13 June in Biological Reviews, 
synthesized studies from 58 landscapes 
across Asia and new camera trap data from 
20 sites in Thailand, Malaysia, Singapore, Su- 
matra, and Borneo. It found that long-tailed 
macaques are rare in intact forests, but can 
be “hyperabundant” where forest has been 
converted to croplands, including oil palm 
plantations. Those disturbed landscapes can 
feature habitat similar to the coastal and riv- 
erside vegetation the animals have long oc- 
cupied. The data “clearly show” the macaque 
is not yet globally endangered, says co-author 
Matthew Luskin, an ecologist at the Univer- 
sity of Queensland. 

Laurence says IUCN is “being alarmist and 
not basing its decisions on good data.” Other 
researchers point to the assessment’s char- 
acterization of a 2021 study by conservation 
scientist Matthew Nuttall of the University of 
Stirling and colleagues. It tracked population 
trends of long-tailed macaques in a Cambo- 
dian reserve. The assessors note that study 
found numbers had declined by 50% over a 
decade. But the paper “actually states that 
the species is stable,” says conservation bio- 
logist Alice Hughes of the University of Hong 
Kong, who is not involved in the dispute. 

Nuttall says the study did generate the two 
data points used by IUCN assessors. But con- 
sidering other observations, he says the study 
“says the species are stable,” while suggesting 
“a shallow decline.” (Hansen notes all but one 
observed population declined by 50%.) 

Luskin contends the macaque is vulner- 
able but not endangered. Whether IUCN 
will agree is uncertain. One complexity, says 
ecologist Jedediah Brodie of the University 
of Montana, who is not involved in the dis- 
pute, is that “there could be a global decline 
in the species at the same time as popula- 
tion increases in certain edge habitats.” 

IUCN has asked NABR and the assessors 
to negotiate a resolution. If that fails, IUCN 
could ask outside experts to help reach a de- 
cision, which could take a year or more. & 
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India moves to adopt bill that 
would weaken forest protections 


Government says measure is needed to fight climate 
change, but conservationists have opposed it 


By Tanvi Dutta Gupta 


onservation scientists fear more than 
one-quarter of forests in India could 
lose legal protection under contro- 
versial legislation that the nation’s 
Parliament could approve as early as 
this week. 

The legislation amends India’s flagship 
1980 Forest Conservation Act. The govern- 
ment of Prime Minister Narendra Modi says 
it will help India meet its commitments to 
combat climate change by planting trees, 
and “eliminate ambiguities” in rules that 
govern how officials legally define forests 
and regulate their use. 

But researchers and others worry the 
measure—which has triggered nationwide 
protests—will irrevocably transform In- 
dia’s landscape. They say the amendments 
will open forests to development, harm 
biodiversity, and weaken the rights of In- 
digenous people. 

The legislation “is ecocide,’ says conser- 
vation biologist Ravi Chellam, CEO of the 
Metastring Foundation, which makes policy 
relevant data publicly available. “People are 
gobsmacked by the brazenness of it all.” 

Since Modi’s government first floated the 
legislation in 2021, it has attracted wide- 
spread criticism. Conservationists oppose 
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provisions that would remove protection 
from vast swaths of forest that have not 
been officially recognized in government 
documents. They say the proposal would 
also make it easier to mine in protected 
areas, and build infrastructure related to 
ecotourism, including zoos and resorts. 

Human rights activists have decried 
language that would reduce the need for 
developers to consult with or gain prior 
consent from forest-dependent communi- 
ties, including Indigenous groups. Advo- 
cates also raised alarm about provisions 
allowing the government to waive reviews 
of projects that are within 100 kilometers 
of India’s border and deemed critical to na- 
tional security. In some border states with 
high biodiversity, that exception would 
cover nearly all forested land. 

“One could argue this is not just an 
Amendment but an entirely new Act,” more 
than 400 ecologists wrote last month to 
India’s environment minister after the gov- 
ernment moved to present an unchanged 
version of the bill to Parliament. They asked 
the minister to delay any vote pending con- 
sultation with experts. 

On 26 July, however, Parliament’s lower 
house took less than 20 minutes to pass 
the bill after almost no debate. As Science 
went to press, Parliament’s upper house 


law would reduce protection of the nation’s forests. 


was expected to follow suit. 

The legislative rush has left many conser- 
vationists demoralized. “Things are already 
very bad with Indian forests,” says Ghazala 
Shahabuddin of the Ashoka Trust for Re- 
search in Ecology and the Environment. 
Now, “whatever little we have remaining is 
under threat.” 

Analysts estimate nearly 200,000 square 
kilometers of forest will lose legal pro- 
tection under the bill. At particular risk, 
Shahabuddin and others say, are forests 
managed by local communities, which 
rarely enjoy formal recognition. The law 
will “ride roughshod” over the rights of 
people who live in and use these forests, 
the ecologists who signed last month’s let- 
ter forecast. For example, it “does not pro- 
vide any clarity” about how officials should 
consider existing land rights claims filed by 
Indigenous groups, says Pranav Menon, an 
anthropology graduate student at the Uni- 
versity of Minnesota and legal adviser to the 
Van Gujjar Tribal Yuva Sangathan, a youth- 
led Indigenous group. 

The law continues a legislative trend of 
“reducing people to just rubber stamps,” 
Shahabuddin says, noting that Parliament 
has taken other steps to weaken public par- 
ticipation in environmental decisions. 

India’s environment ministry argues the 
regulatory changes will support efforts 
to plant new forests and use trees, which 
absorb and store carbon, to fight climate 
change. And it asserts any loss of existing 
forests will be made up by creating new 
plantations elsewhere. 

But, “The devil is in the details,’ 
Chellam says, noting that if single-species 
plantations replace more diverse forests 
both biodiversity and the climate could suf- 
fer. “Functional, dynamic ecosystems will 
do a far better job of carbon sequestration 
than species-poor tree plantations,” he says. 

Once finalized, the new law will likely 
face legal challenges. “The Supreme Court 
will be flooded” by lawsuits questioning the 
law’s constitutionality, predicts Debadityo 
Sinha, a climate and ecosystems specialist 
at the Vidhi Centre for Legal Policy. One 
advocacy organization, the Environmental 
Support Group, is already arguing that the 
measure was “proposed in fundamental vio- 
lation” of rules requiring coordination with 
India’s Ministry of Tribal Affairs. 

In the meantime, researchers and conser- 
vationists are bracing for the worst. The pol- 
icy changes are “so short-sighted,” Chellam 
says. “Everyone is aghast, not just about 
their lives, but about the lives of future gen- 
erations of Indians.” 
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In its early days, enslaved Black laborers worked at the Catoctin Furnace ironworks, shown here in an 1890 photo. 


FORGING 


CONNECTIONS 


DNA from enslaved Black workers at a 
19th century iron forge links them to living descendants. 
But the research swirls with ethical questions 
By Andrew Curry 


anson Summers first appears in 

Maryland archives in 1834, when 

an inventory from Catoctin Fur- 

nace, an iron smelter in the ore- 

rich foothills of Maryland’s Blue 

Ridge Mountains, lists an enslaved 

17-year-old boy by the name of 

“Henson.” Fifteen years later, the 

furnace’s owners mention in a 

handwritten ledger that Henson was being 

sold for $500 to another iron smelter nearby. 

Then, in 1899, an obituary in the Hager- 

stown, Maryland, newspaper lauds Hanson 

Summers, age 82, who “was noted for his 

great strength and thought nothing of wheel- 

ing half a ton. ... He was employed at the An- 
tietam Iron Works.” 

Summers’s story of survival and iron- 

working skill was uncovered by genea- 
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logists and historians hoping to tell the story 
of Black workers at Catoctin Furnace, a hell- 
ish place centered on a three-story furnace 
kept burning for months at a time. Genea- 
logists from the Catoctin Furnace Historical 
Society tracked down Summers’s descen- 
dants, including his 86-year-old great-great- 
granddaughter, Agnes Jackson. On June- 
teenth this year, Jackson and her daughters 
drove to Catoctin for the first time from their 
homes in nearby Hagerstown. 

“We never knew we were connected to 
people here,’ Jackson says, sitting on a fold- 
ing chair in a small museum dedicated to the 
forge and its history. “[Our ancestors] were 
enslaved—it wasn’t a topic people wanted to 
talk about.” 

Last year, Jackson, who identifies as Af- 
rican American, also took a DNA test: She 


heck for 


a 


hoped to learn whether she and her daugh- 
ters are related to any furnace workers who 
were buried in a small cemetery near the site. 
For the first time, answering such a question 
may be possible, thanks to a new study of 
DNA of 27 people from that cemetery pub- 
lished on p. 500 (related Perspective, p. 482). 

Other analyses of the bones had already of- 
fered a glimpse of the harsh lives of enslaved 
people in an industrial setting. Now, research- 
ers have partnered with the consumer DNA 
testing company 23andMe to compare the 
ancient Catoctin Furnace genomes to those 
of almost 10 million living people in the com- 
pany’s database. They have identified 41,799 
relatives, including hundreds of potential di- 
rect descendants. But the company hasn’t yet 
notified any descendants about their results, 
as it works through the ethics of contacting 
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people who agreed to let their data be used 
in anonymized studies. At the same time, 
researchers are wary of traumatizing people 
unprepared for links to enslaved ancestors. 
Many in the Black community say such 
information—when collected with commu- 
nity consent—would have poignant value. 
“For marginalized communities whose his- 
tory has been obscured, this technology can 
be leveraged to tell their stories,’ says Jada 
Benn Torres, a Black biological anthropo- 
logist at Vanderbilt University who was not 
part of the new study. “There’s a beauty in 
connecting the past to the present.” 
“Connections to the past were severed be- 
cause of the trans-Atlantic slave trade,’ ex- 
plains Carter Clinton, a geneticist at North 
Carolina State University who is African 
American. (When possible, Science uses 


SCIENCE science.org 


sources’ own preferred identification, such as 
African American or Black.) “Today there’s a 
thirst for knowledge. How do we reconstruct 
those lives and find out where our ancestors 
came from, and what does that mean for how 
I identify today? It’s something we want and 
don’t have in comparison to other ethnicities 
in America.” 

But rapid advances in genetics and genea- 
logy open up hard questions about the power 
dynamics of studying the remains of enslaved 
people, such as who speaks for communities 
with no known genetic descendants. “This 
is one of the better executed projects I’ve 
seen in terms of the way they included the 
community and have been thoughtful about 
including Black American scholars,’ says 
Alexandra McDougle, a Black archaeologist 
at Columbia University. But, she says, “There's 


at the Catoctin Furnace - 
Ns “forge, stands:in the ruins of |” 
the owner's.mansion,” 


so much farther to go.” Some think the Cato- 
ctin researchers didn’t do enough to involve 
today’s Black community, for example. 
Clinton adds that linking DNA from his- 
toric remains to known living relatives “is 
significant for every African American in this 
country, and I’m rooting for a way to figure 
out how to make this happen ethically. The 
question is, who is the ultimate decision- 
maker” in making those connections? 


TODAY, CATOCTIN FURNACE SITS just off 
Maryland’s Highway 15, at the foot of the 
Blue Ridge Mountains 80 kilometers north of 
Washington, D.C. On a muggy but cool June 
morning, Elizabeth Comer pushes into the 
woods, headed for the cemetery. The trees are 
still dripping from showers the night before, 
and the thrum of cars nearby is constant. 
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Two hundred years ago, the noise around 
the cemetery would have been different—the 
hammering of iron, ringing of picks in an ad- 
jacent ore pit, and roar of a bellows-fired blast 
furnace—but just as oppressive. “It was never 
quiet,” says Comer, a white archaeologist 
and president of the Catoctin Furnace His- 
torical Society. “Tt still isn’t” 

As she picks her way through thick 
undergrowth, Comer, who grew up nearby, 
describes the furnace’s history starting in 
the 1760s, when it was founded to supply 
metal for the fast-growing American colo- 
nies. Catoctin churned out shells, cannon- 
balls, and kettles for the Continental Army 
in the Revolutionary War. After the war 
ended, production shifted to peacetime 
products such as stoves, pots and pans, 
and raw “pig iron.” 


or leased to other iron works, then gradually 
replaced by immigrants from Ireland, Eng- 
land, and Germany, who had to pay furnace 
owners for food and lodging. “There's this 
sudden replacement by free European labor, 
and it’s not clear where the Black community 
went,’ McDougle says. Twentieth century 
historians lauded the immigrants’ contri- 
butions but skipped over the pivotal role of 
Black workers. 

The cemetery was forgotten, too, until an 
archaeological survey ahead of highway con- 
struction revealed human remains. In 1979 
and 1980, a rescue excavation uncovered 
35 graves, 32 of which contained human re- 
mains. Some of the simple, uncarved white 
and gray stones used as grave markers still 
sit today in the woods where archaeologists 
set them aside more than 40 years ago. The 


Sharon Green, Vicki Winston, and Barbara Hart (left to right), at the ruins of the Catoctin Furnace site, are the 
great-great-great-granddaughters of one of its enslaved workers, Hanson Summers. 


When the furnace ran, a waterwheel 
turned a massive bellows, heating charcoal in 
a stone chimney, or “stack,” 10 meters high. 
Fires blazing for months at a time, the stack’s 
hunger for fuel stripped nearby slopes bare 
of trees. Dozens of workers and families lived 
and toiled under a constant pall of smoke. 

From its first days, the furnace’s white 
owners likely used Black labor for the hardest 
jobs, Comer says. Black workers were respon- 
sible for everything from keeping the massive 
furnace fires burning to channeling molten 
iron into sand molds. Work was constant. A 
German-speaking Moravian pastor, Frederick 
Schlegel, described conditions at the forge 
as “lamentable” in his diary after visiting 
in 1799. He recalled that enslaved workers 
crowded up to him and “complained how 
hard they work every day, including Sundays, 
to keep the iron flowing.” 

In the 1830s and 40s, many of the forge’s 
enslaved workers were sold, like Summers, 
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graves were dated to about 1800, and features 
of the bones showed they belonged to people 
of African ancestry. Maryland officials later 
transferred them to the Smithsonian Institu- 
tion, where they sat in storage for decades. 

In 2014, Comer started to search for lo- 
cal families with ties to the cemetery to help 
tell the site’s story. She was stymied by the 
lack of records: Dozens of people who lived 
at the forge in the early 1800s are remem- 
bered only as fleeting, first-name entries in 
inventories and diaries: Milly, Hercules, Bob, 
Clowy, Lusinda, Elizabeth, Andrew. Their 
stories and connections to living individu- 
als were lost; Comer was able to reconstruct 
Summers's story only because his first name 
was unusual. 

In that sense, Catoctin Furnace is far from 
unique. Until the census of 1870, U.S. records 
only rarely referred to enslaved Black people 
by full name. Enslaved families were often 
broken apart on purpose, splintering family 


trees. For Black people seeking their ances- 
tors, that created “a brick wall of slavery prior 
to 1870,” says historian Henry Louis Gates Jr. 
of Harvard University, a co-author on the 
new study, who is African American. To see 
beyond that wall, Comer, also a co-author, ap- 
plied for a grant to reanalyze the cemetery’s 
bones. She hoped new methods, including 
ancient DNA analysis, would tell a more com- 
plete story about Catoctin’s Black workers. 


DNA SAMPLING PRESENTED a chicken-or-egg 
ethical problem: Until scientists had DNA 
results from the unnamed individuals in 
the cemetery, they couldn’t get consent 
from genetic descendants to sample re- 
mains. So they applied a broader definition 
of “descendant,” expanding it to local Black 
community members, whom they asked 
about the study. 

This expansive concept of kinship, which 
originated with work in the 1990s at the 
New York African Burial Ground in lower 
Manhattan, has been applied in the hand- 
ful of other studies of the DNA of enslaved 
people. “It could be a community that’s not 
related but is residing in the region, who 
share a story and feel a responsibility for 
that enslaved community,’ explains Clinton, 
who did his Ph.D. work on the African Burial 
Ground, now a national monument. “Be- 
cause of these broken lineages, you can de- 
fine descendant in different ways—not just 
‘am I genetically related to you, but ‘are you 
invited to the cookout?” 

Comer and Smithsonian curators sought 
support from the African American Re- 
sources Cultural and Heritage (AARCH) 
Society, a nonprofit in Frederick, Maryland, 
focused on documenting the county’s Af- 
rican American history. AARCH Society 
members visited the Smithsonian lab in 
2016, and the group’s then-president sup- 
ported the studies, including drilling into 
the bones for DNA. “When this started I 
didn’t have a direct descendant commu- 
nity,’ Comer says. “Now, thank God, we’re 
getting one.” 

Before geneticists began to extract DNA, 
biological anthropologists at the Smithso- 
nian studied the human remains in an at- 
tempt to reconstruct life at the furnace. The 
bones offer testimony to conditions differ- 
ent from those at plantations, and at least 
as harsh. Nearly half of the 32 graves con- 
tained the remains of children under age 
4, some of whom had bowed lower legs— 
a classic symptom of vitamin D deficiency 
known as rickets. A smoky haze so thick 
it blocked the Maryland sun, much like a 
wildfire, likely reduced vitamin D produc- 
tion in skin. 

By analyzing heavy metals in the bones, 
the researchers were able to document in- 
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Catoctin’s living legacy 

Researchers used 23andMe’s massive genetic database to trace living people who share significant DNA with enslaved people buried in the 

19th century at Catoctin Furnace. Many descendants still live near Catoctin, with another cluster in California. (To preserve privacy, the company 
blurred the precise locations of participants, bumping some data outside U.S. borders.) 


10) 


oo0o0Q000 00 


Number of participants with 
significant match to Catoctin DNA 


dividual exposure to toxins. Some people 
had high levels of lead and zinc; one man 
in his late 40s had zinc levels nearly triple 
that of most enslaved people in the region. 
He might have worked as a “filler,” shoveling 
charcoal and zinc-rich iron ore into the blaz- 
ing furnace, Comer says. 

Other results set Catoctin apart, too. In the 
19th century, death usually claimed the very 
young or very old, or women in childbirth. But 
the 32 people excavated at Catoctin include 
five teenage boys. “Jobs that were considered 
menial labor were pretty dangerous, and if 
you weren’t skilled you were at risk from 
dying,” says co-author Kari Bruwelheide, 
a white biological anthropologist at the 
Smithsonian National Museum of Natural 
History (NMNH). “They were doing jobs 
they weren’t physically ready for.’ One boy 
who died at age 12 or 13 had a dislocated 
shoulder; a 15-year-old had herniated discs. 

One elderly man Bruwelheide examined 
had such severe spinal damage, he would 
have been bent nearly double when he died. 
An 1804 entry in Schlegel’s diary offers an 
uncanny reference that might be a match. 
“T then visited a cripple who walked all bent 
over,’ Schlegel wrote. “He lives in great need.” 
(Even the missionaries perpetuated identity 
loss for enslaved people: Schlegel and his 
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brethren omitted the names of the Black 
forge workers they baptized and buried, 
while dutifully recording white identities.) 

In 2017, geneticist David Reich of Harvard, 
who is white, and his colleagues joined the 
team. They sampled DNA from the remains 
and sequenced the whole genomes of 27 peo- 
ple, identifying five family groups of moth- 
ers and children within the cemetery. None 
included any adult male relatives, possibly a 
sign men and women were buried separately, 
in keeping with the Moravian missionaries’ 
traditions. Comparing the DNA in the Ca- 
toctin samples with databases of modern 
populations revealed that more people in the 
cemetery had European ancestry through 
the male than the female line, consistent 
with historic evidence of owners fathering 
children with enslaved Black women. 

Comer wondered whether scientists could 
link the unnamed bodies at Catoctin Furnace 
to living people. “By 2020, ancient DNA tech- 
nology was close to being able to find con- 
nections between people who lived in the 
past few hundred years and people today,” 
says co-author Eadaoin Harney, then a Ph.D. 
student in Reich’s lab and now a researcher 
at 23andMe. “We wanted to develop this 
technology and apply it to historic popula- 
tions where it would have the most impact.” 


Pennsylvania 


Catoctin 
” Furnace 


Maryland 
District of 
Columbia 


Virginia 


Deploying modified versions of the tools 
23andMe and other direct-to-consumer com- 
panies use to find genetic relatives, Harney 
identified stretches of DNA shared between 
people in company’s database and people in 
the cemetery, segments called “identical by 
descent” (IBD). “The more and longer seg- 
ments you share, the closer your relation- 
ship,’ Harney, who is white, explains. 

Most of the 41,799 people who matched 
had just a few, short IBD segments in com- 
mon with people from Catoctin Furnace. 
That suggests distant connections dating 
back centuries, to shared ancestors in Africa 
or Europe who lived generations before the 
cemetery was in use. 

Harney also identified 2975 living 
people in the 23andMe database who 
shared significantly higher amounts 
of DNA with the people from Catoctin 
Furnace. Some were estimated to be as 
little as five degrees removed—the equiva- 
lent of a_ great-great-great-grandparent, 
about right for a direct descendant 
225 years later. Because 23andMe’s data- 
base includes geographic data, Harney 
could even suggest where the cemetery’s 
descendants ended up. Many families didn’t 
go far—dozens of very close relatives still 
live in Maryland, and many more are spread 


4 AUGUST 2023 * VOL 381 ISSUE 6657 4&75 


NEWS FEATURES 


Members of the Summers 

family look at a photo of their 
great-great-grandfather Emory, the 
son of Hanson Summers, who 
once labored at Catoctin Furnace. 


across the southeastern United States (see 
map, p. 475). To protect participants’ pri- 
vacy, only employees of 23andMe know the 
exact locations. 

But for now, those descendants remain 
unaware of the connection. People in the 
23andMe database whose sequences were 
matched to those of the Catoctin workers 
signed a consent form with the company al- 
lowing use of their DNA for “a wide variety of 
research topics.” But the agreement promised 
their data would be anonymized and results 
would not be returned to them directly. The 
company is still working out how to ethically 
offer most people the possibility of being di- 
rectly connected to their forebears. 


SUMMERS IS NOT buried in the Catoctin cem- 
etery: He died more than half a century af- 
ter it fell into disuse. On Juneteenth, sitting 
on a folding chair in the museum, his great- 
great-granddaughter Jackson pondered the 
mystery of her family’s link to the site. 

Jackson is old enough to remember drink- 
ing from segregated water fountains and 
watching movies from the Black-only balcony 
of the local theater. But she doesn’t remember 
hearing her parents, farmers who lived on 
the outskirts of Sharpsburg, Maryland, ever 
talking about Catoctin as part of the family’s 
history. “We never knew we were connected 
to people here. This has taught me we didn’t 
know everything about our family.” 

For Jackson’s daughter Sharon Green 
and her sisters, the lack of documentation 
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as well as the horrors of slavery discour- 
aged deep genealogical research. “We only 
knew to go back to our grandparents. We 
didn’t know there was anything beyond 
that to be found,” Green says. 

Connecting the furnace workers to living 
descendants, as the new study aims to do, 
could be a way to address past injustices, 
says Fatimah Jackson, an African American 
geneticist at Howard University. If Catoctin 
descendants can be made aware of their DNA 
results, she says, people could “make connec- 
tions that were intentionally severed to pre- 
vent uprisings and community-building. It’s 
a way to say, ‘you didn’t win, we’re winning,” 

But others think the team behind the study 
could have done more to put descendants 
and their questions at the forefront. Indeed, 
spurred by the Black Lives Matter movement, 
a group of researchers in 2021 published a 
call to pause research on African American 
remains entirely until laws are passed to ad- 
dress consent and access to data. 

That appeal came well after the Catoctin 
research was underway, Still, the current 
AARCH Society leaders feel the consent given 
more than a decade ago is no substitute for 
broad community engagement. “[Scientists] 
need to build a relationship with the commu- 
nity, and it’s going to take more than a nod 
from AARCH to do that,” says the group’s cur- 
rent president, Protean Gibril. 

Rachel Watkins, a Black biocultural an- 
thropologist at American University who has 
worked with AARCH Society members and 


families in western Maryland, says securing 
support from the society to take samples was 
the bare minimum. More engagement should 
have followed the decision to restudy the 
bones, Watkins argues, but AARCH Society 
members were sidelined. “Based on current 
engagement models, merely showing results 
to communities isn’t adequate,’ Watkins says. 
The collaboration with 23andMe, for exam- 
ple, could have drawn on the AARCH Soci- 
ety’s genealogical knowledge about the local 
Black community to guide study design. 

Watkins also thinks the project could have 
worked harder to let descendants’ questions 
steer the study. Some African Americans in 
Maryland have in fact been able to break 
through the “brick wall” and trace their an- 
cestry back well before the 1870s, she says. 
The project could have asked those descen- 
dants what they’d like to know about the 
Catoctin people, Watkins says. “What they’ve 
discovered doesn’t necessarily address ques- 
tions people who have broken through the 
wall might have.” 

But Gates vigorously defends the project’s 
ethics and approach. “Rather than this being 
an example of exploitation of a vulnerable 
population, it is an example of the opposite: 
of deploying scientific tools to address ques- 
tions of long-standing interest to African 
Americans, and at the community’s request,’ 
Gates says. “The study is a tool for empower- 
ment of African Americans. I would not have 
joined the study or been so enthusiastic if 
that was not the case.” 


science.org SCIENCE 


PHOTO: AMERICA'S HIDDEN STORIES: FORGED IN SLAVERY/SMITHSONIAN CHANNEL, 


PHOTO: KINTSUGI KELLEY-CHUNG/AAAS 


Another point of tension is publication 
of the raw genetic data, which is typical in 
ancient DNA studies. Some historically mar- 
ginalized communities, however, fear the loss 
of control and privacy that comes with such 
publication. For example, when researchers 
obtained DNA from 36 people at the Anson 
Street African Burial Ground in Charleston, 
South Carolina, the research was published, 
but the DNA sequences were not deposited 
in open databases and any future research 
requires the local community’s permission. 

NMNH, where the Catoctin remains are 
housed, requires open access to data gener- 
ated from sampling its specimens. The data 
in the Science study were posted on the Eu- 
ropean Nucleotide Archive last year, allowing 
anyone to access it. Reich argues that in this 
case, open publication of data was important 
to help prevent commercial exploitation. “We 
didn’t want 23andMe to have exclusive ac- 
cess,” he says. “The only ethical approach is to 
make the historical data fully pub- 
licly available, so that the for-profit 
company cannot have proprietary 
ownership over data. This is par- 
ticularly important for DNA from 
enslaved individuals who did not 
have agency over their remains.” 

But open data can lead to ex- 
ploitation, too. In a development 
Reich calls “negative,” some com- 
panies are using the genes of 
enslaved people to tout genetic 
ancestry analyses, often without 
clear explanations of their meth- 
odology. History sells: Customers 
are lured by the chance of finding 
genetic links to Viking shield maid- 
ens, Roman gladiators—and, since 
their 27 genomes were posted on 
a preprint server last summer, en- 
slaved Black people from Catoctin 
Furnace. “Historical evidence shows Afri- 
can slaves were chosen with backgrounds 
in iron manufacturing,” the website of 
Switzerland-based MyTrueAncestry reads. 
“You can find out if they are part of your an- 
cestry. FIND OUT HOW” 

Comer keeps a spreadsheet tracking nearly 
200 hopeful inquiries she’s received since the 
company incorporated Catoctin Furnace 
into its marketing last summer. Based on 
what MyTrueAncestry has released about its 
methods, she thinks most inquirers are about 
as related to Catoctin as anyone with some 
African ancestry—very, very distantly. “Is 
that exploiting the desire within the African 
American community to have more informa- 
tion about their ancestry? Yes,” she says. “It’s 
given a lot of people false hope.” 

But Comer defends her project’s overall 
impact, saying it has helped build a Black 
descendant community at Catoctin Furnace 
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that otherwise wouldn’t exist. “I can’t say 
the project started with the community of 
descendants, because it didn’t;’ Comer says. 
“And it could have, had they been in a posi- 
tion to know the cemetery existed and these 
advances in archaeology existed.” 


ALTHOUGH MANY PEOPLE are eager to learn 
the stories of their enslaved ancestors, oth- 
ers might find the information traumatic, 
Comer and Reich say. They and their collab- 
orators note this concern in a commentary 
published this week in The American Jour- 
nal of Human Genetics. “A huge part of this 
research is the responsibility to disseminate 
information in a sensitive way and be pre- 
pared for the psychological impact it might 
have,’ Clinton says. 

Some descendants may be unaware of 
their links to slavery or even to the Black 
community. For example, at the moment 
there’s just one other Catoctin worker whose 


Distant cousins Crystal Claggett (left) and Steven Pilgrim both count a 
freed Black worker at Catoctin Furnace among their ancestors. 


descendants have been identified. In 1979, 
around the same time researchers were ex- 
cavating the Catoctin Furnace cemetery, 
Steven Pilgrim, a materials scientist at Al- 
fred University who identifies as white, made 
an unexpected discovery in Pennsylvania 
state archives. “The 1910 census shows my 
great-grandmother was Black,’ Pilgrim says. 
According to family lore, Manzella Grace 
Patterson died in childbirth in 1914, at age 38. 
Her husband remarried soon thereafter. 
Pilgrim kept digging, eventually trac- 
ing his great-grandmother’s ancestry back 
to a free Black ironworker named Robert 
Patterson, who lived near and labored at Ca- 
toctin Furnace in the mid-1800s. Pilgrim and 
other family members have since compiled a 
list of more than 1000 living Patterson rela- 
tives. Many have become part of the Catoctin 
Furnace community, participating in events 
on-site and getting briefed on the genetic 


research at the cemetery. However, the fam- 
ily may not be genetically related to anyone 
buried in the cemetery, which was no longer 
in use by the time Pattersons crop up in Ca- 
toctin records. 

Comer recognizes that not every family 
might welcome news of connection to the 
cemetery. “When I look at the 23andMe re- 
sults and see Maryland light up, I have a feel- 
ing I know some of these people—and they 
may not know, or want to know, they’e re- 
lated to skilled African American ironwork- 
ers,” she says. “That’s the American story, and 
that’s the story here.” 

As researchers think through the ethics 
of contacting descendants, medical research 
can provide a model for consent, suggests 
Alondra Nelson, a sociologist of science at 
the Institute for Advanced Study, who is Afri- 
can American. “If you’re doing genome-wide 
analysis and find someone has a disease, 
some people want to know, others don’t,’ 
she says. The key is to ask before 
doing the study. If people didn’t 
agree to learn about genetic con- 
nections with historic individu- 
als when they submitted their 
DNA to 23andMe, it might not 
be ethical to contact them now, 
she says. 

Clinton agrees, and also thinks 
23andMe’s clients should have 
been able to specifically consent— 
or decline—to have their DNA used 
in work on historic people. “People 
should be fully aware of how their 
data is being used, especially if it’s 
being used downstream in other 
ways that are not directly benefi- 
cial to them,” he says. 

Agnes Jackson and her daugh- 
ters, the known descendants of 
Summers, with Pilgrim and other 
Patterson family members, were briefed on 
the overall DNA results in a video call with 
Comer, Harney, and Reich in early June. 
For them, more answers are on the way: 
23andMe said in a statement that it plans 
to return results to the two documented 
descendant families, with the help of the 
Catoctin Furnace Historical Society and the 
Smithsonian. 

As the afternoon of Juneteenth shaded 
into dusk, Jackson’s daughter Green looked 
around the Catoctin Furnace museum. For 
now, she says, the realization that their an- 
cestor was a metalworker has changed the 
way she views her past. “They always made 
it out that we were so uneducated and insig- 
nificant, that our past began with cotton and 
ended with cotton,’ she says. “When I look at 
the price they paid for him, that skill set—it 
blows me away. We were educated people, 
with valuable skill sets.” m 
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Community-initiated genomics 


Ancient DNA is used to connect enslaved African 
Americans to modern descendants 


By Fatimah L. C. Jackson 


pproximately 200 years ago, a largely 
enslaved African American  work- 
force was replaced by paid European 
workers at Catoctin Iron Furnace in 
Maryland, USA. The life histories of 
the uncompensated African American 
workers are largely unknown. On page 500 
of this issue Harney et al. (1) describe the use 
of ancient DNA analysis contextualized by a 
comprehensive history to reveal multigen- 
erational connections between colonial and 
modern-day African Americans. Both genetic 
and genealogical aspects of the study were 
initiated by informed community engage- 
ment and consent. Through these findings, 
the voices of these African American work- 
ers have begun to be heard, and the sad irony 
of their situation in producing ammunition 
for the American Patriots while the British 
offered them emancipation during the 
Revolutionary War is highlighted. 

The enslaved African American workers 
at Catoctin Iron Furnace generated approxi- 
mately 10 tons of bombshells for General 
George Washington’s Continental Army (1775- 
1783). Access to this ammunition proved criti- 
cal during the siege of Yorktown, Virginia, in 
1781. The battle at Yorktown was the last ma- 
jor land battle of the American Revolutionary 
War in North America and led to the surren- 
der of British General Charles Cornwallis and 
the capture of both him and his army. This 
prompted the British government to negoti- 
ate an end to the conflict, ultimately resulting 
in an American victory. Ironically, the British 
had promised freedom to enslaved African 


Howard University and QuadGrid Data Lab, Washington, 
DC, USA. Email: fatimah.jackson@howard.edu 
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Americans, and tens of thousands had es- 
caped to the British side just before and dur- 
ing the Revolutionary War. After the crushing 
defeat of the British at Yorktown, enslaved 
people were recaptured from the British 
and returned to their American “owners,” in- 
cluding George Washington. However, over 
50,000 formerly enslaved African Americans 
left North America with the British army in 
1783 (2). Slavery was eventually abolished 
in the US in 1865 but, if the British had won 
the Revolutionary War and kept true to the 
promise of emancipation, a large segment of 
enslaved African Americans would have pos- 
sibly become free nearly 100 years or five gen- 
erations earlier. 

The investigation by Harney et al. was per- 
formed because members of the modern-day 
community in Catoctin wanted to know more 
about the African American furnace workers. 
Areas of interest included where these work- 
ers came from in Africa, where they had gone 
after being freed from Catoctin, and to whom 
they were related. Community members from 
the African American Resources Cultural and 
Heritage Society of Frederick County con- 
tacted experts at the Smithsonian Institution 
and raised USD 14,000 to fund research to re- 
construct the lives of these African American 
ancestors. As a result, Harney e¢ al. retrieved 
DNA samples from skeletal material of 27 en- 
slaved and free colonial African Americans 
buried in Catoctin Iron Furnace. 

They compared the genetic data from 
these 27 colonial African Americans (rep- 
resenting five genetically distinct families) 
with current data from a commercial data- 
base not known to have high proportions 
of predominantly African American partici- 
pants (3) and identified nearly 42,000 living 
descendants throughout the United States. 


Agnes Jackson, Barbara Hart, Vicki Winston, and SI 
Green, descendants of an enslaved worker at Catocun, 
visit the site of the furnace for the first time. 


Notably, a large group of genetically related 
descendants was found to reside 2970 miles 
away in southern California. Many of the 27 
historic Catoctin workers were found to be 
from the contemporary regions of Senegal 
and Gambia in West Africa and Angola and 
the Democratic Republic of the Congo in 
Central Africa. This geographical range of 
ancestry highlights the opportunistic nature 
of the trade in enslaved Africans and the het- 
erogeneity of its survivors. 

The large numbers of genetic descen- 
dants from Catoctin and their subsequent 
wide geographical distribution are infor- 
mative in several important ways: that the 
will to survive and have children was strong 
among these colonized African Americans; 
the desire to thrive through relocation 
was persistent; and contemporary African 
Americans are likely more closely related 
to each other than had been previously 
surmised. The current 42 million Legacy 
African Americans (who have a legacy of 
enslavement and institutionalized segrega- 
tion) are derived from a proportionately 
small number of African founders including 
less than 100,000 women (4). 

Ensnarement in the centuries of American 
chattel slavery was far worse than has been 
thought (5). The psychological and physi- 
ological tolls are still apparent generations 
later in the epigenomes (6), and possibly the 
microbiomes (7), of descendants. In addition 
to the alienation associated with enslave- 
ment, there is a sense of anomie and disas- 
sociation through the loss of personal iden- 
tity that is linked to the physical brutality of 
enslavement (8). The dream of the enslaved 
to be free is powerful. Equally strong was the 
desire to be united with one’s family, to be 
reconnected to one’s offspring, to have de- 
scendants, to recognize one’s ancestors, and 
to belong to a group of substance and mean- 
ing. Accurate and comprehensive genomics 
has the capacity to liberate contemporary de- 
scendants and honor ancestors. The research 
from Catoctin Iron Furnace opens the gates 
for an entirely new way of bringing the ben- 
efits of genomics to the masses and convinc- 
ing them of its relevance. 

In the past, research involving African 
American individuals has been top-down 
and researcher initiated, which means it 
was done to, rather than with, the African 
American community. This traditional ap- 
proach has contributed to the low accep- 
tance of genetic information in the African 
American community and the hesitancy of- 
ten expressed about the utility and veracity 
of genetic markers of disease and ancestry. 
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What makes the work of Harney e¢ al. so 
pioneering is that the research was initiated 
by an engaged local community of African 
Americans and results were structured to 
meet the needs, priorities, and sensibilities 
of the larger African American community. 
This is the way that this type of research 
should be performed, and it provides a 
blueprint for future studies. 

The genetics of diseases and disorders 
that contribute to health disparities among 
African Americans have been understud- 
ied, and even fewer studies have connected 
genetic disease and ancestry in this popula- 
tion. Given the centrality of community en- 
gagement to the study of Harney et al., it is 
disappointing that the authors did not focus 
on the genetics of chronic complex diseases, 
such as cancer (9), high blood pressure (10), 
stroke (12), and diabetes (12), which dispro- 
portionately kill and disable members of the 
African American community. These diseases 
have a genetic component as well as impor- 
tant environmental aspects, and are of public 
health importance. Providing such applied 
genetic information on common complex 
diseases to the Catoctin Iron Furnace descen- 
dants (and by extension to all Legacy African 
Americans owing to the reported small 
number of female founders) would have em- 
phasized the relevance of genetic studies to 
the physical health as well as psychological 
well-being of the African American com- 
munity. Investigating the issues of greatest 
importance to the African American com- 
munity will be essential as research contin- 
ues to examine the biological histories of this 
population. Steps toward increasing African 
American participation in clinical trials, 
pharmacogenomic trials, and future ances- 
tral genetic studies include training more 
African American researchers in the scien- 
tific methods used and ensuring that African 
American individuals are contributing to all 
aspects of such projects. 
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Gut epithelium modulates 
fungal pathogenesis 


Specialized epithelium secretes an antifungal peptide 


By Woan-Yu Lin*?3 and lliyan D. Iliev:234 


ntestinal epithelial cells (IECs) line the 

inner surface of the intestinal tract and 

play crucial roles in digestion, absorp- 

tion, and defense. By interacting with gut 

microbiota, IECs are essential for main- 

taining gut homeostasis and regulating 
immune responses. On page 502 of this is- 
sue, Pierre et al. (1) report that Paneth cells 
in the intestinal epithelium secrete peptide 
YY (PYY,, ;)» which has antifungal activity in 
the gastrointestinal (GI) tract of mice. This 
is in contrast to the smaller PYY, ,, peptide 
that is secreted by intestinal enteroendocrine 
cells (EECs) and functions as an appetite hor- 
mone. These findings demonstrate that intes- 
tinal peptides can adopt different properties 
through spatial-specific secretion and diver- 
sified proteolytic cleavage. 

IECs have various functions to maintain 
intestinal homeostasis. Goblet cells produce 
the protective mucus layer, and EECs se- 
crete several hormones into the circulation 
in response to stimuli in the GI tract. Paneth 
cells sense microbes (2) and produce anti- 
microbial peptides (AMPs) to directly kill 


or inhibit a range of intestinal microbes (3). 
Intraepithelial lymphocytes and basolaterally 
positioned immune cells form a multilayered 
defense system. 

In Paneth cells, dysfunctional autophagy- 
related 16-like 1 (ATGI6L1) led to impaired 
autophagy, altered granule exocytosis, and 
reduced production of AMPs in mice (4). 
Deletion of X-box-binding protein 1 (XBP1)—a 
transcription factor of the unfolded protein 
response—in Paneth cells led to endoplasmic 
reticulum (ER) stress and caused increased 
cell death in the intestinal crypts of mice. 
ATGI6L1 deficiency together with XBP1 im- 
pairment further result in severe spontane- 
ous Crohn’s disease—like ileitis in mice (5). 
Therefore, malfunctioning Paneth cells in- 
crease susceptibility to intestinal pathologies, 
primarily as a result of the impairment of 
packaging and secretion of AMPs (4, 6). 

Imbalance (dysbiosis) of fungal microbi- 
ota has been associated with inflammatory 
bowel disease. During intestinal disease, one 
prominent opportunistic fungus, Candida 
albicans, damages macrophages and intesti- 
nal epithelium (7, 8) and further aggravates 
intestinal inflammation through a hyphae- 


Gut mycobiota regulation by Paneth cells 


The gut epithelium exhibits various strategies to regulate the mycobiome, such as the secretion of AMPs, mucins, 


and slgA. The secretory function of Paneth cells is dependent on ATG16L1- and XBP1-dependent pathways. 
These cells also secrete PYY,.3., which exhibits selective antifungal activity against Candida albicans hyphal 
morphotypes in the gut. Whether PYY,.,, secretion is also regulated by the cytokines IL-17 and IL-22 is unknown. 
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pioneering is that the research was initiated 
by an engaged local community of African 
Americans and results were structured to 
meet the needs, priorities, and sensibilities 
of the larger African American community. 
This is the way that this type of research 
should be performed, and it provides a 
blueprint for future studies. 

The genetics of diseases and disorders 
that contribute to health disparities among 
African Americans have been understud- 
ied, and even fewer studies have connected 
genetic disease and ancestry in this popula- 
tion. Given the centrality of community en- 
gagement to the study of Harney et al., it is 
disappointing that the authors did not focus 
on the genetics of chronic complex diseases, 
such as cancer (9), high blood pressure (10), 
stroke (12), and diabetes (12), which dispro- 
portionately kill and disable members of the 
African American community. These diseases 
have a genetic component as well as impor- 
tant environmental aspects, and are of public 
health importance. Providing such applied 
genetic information on common complex 
diseases to the Catoctin Iron Furnace descen- 
dants (and by extension to all Legacy African 
Americans owing to the reported small 
number of female founders) would have em- 
phasized the relevance of genetic studies to 
the physical health as well as psychological 
well-being of the African American com- 
munity. Investigating the issues of greatest 
importance to the African American com- 
munity will be essential as research contin- 
ues to examine the biological histories of this 
population. Steps toward increasing African 
American participation in clinical trials, 
pharmacogenomic trials, and future ances- 
tral genetic studies include training more 
African American researchers in the scien- 
tific methods used and ensuring that African 
American individuals are contributing to all 
aspects of such projects. 
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ntestinal epithelial cells (IECs) line the 

inner surface of the intestinal tract and 

play crucial roles in digestion, absorp- 

tion, and defense. By interacting with gut 

microbiota, IECs are essential for main- 

taining gut homeostasis and regulating 
immune responses. On page 502 of this is- 
sue, Pierre et al. (1) report that Paneth cells 
in the intestinal epithelium secrete peptide 
YY (PYY,, ;)» which has antifungal activity in 
the gastrointestinal (GI) tract of mice. This 
is in contrast to the smaller PYY, ,, peptide 
that is secreted by intestinal enteroendocrine 
cells (EECs) and functions as an appetite hor- 
mone. These findings demonstrate that intes- 
tinal peptides can adopt different properties 
through spatial-specific secretion and diver- 
sified proteolytic cleavage. 

IECs have various functions to maintain 
intestinal homeostasis. Goblet cells produce 
the protective mucus layer, and EECs se- 
crete several hormones into the circulation 
in response to stimuli in the GI tract. Paneth 
cells sense microbes (2) and produce anti- 
microbial peptides (AMPs) to directly kill 


or inhibit a range of intestinal microbes (3). 
Intraepithelial lymphocytes and basolaterally 
positioned immune cells form a multilayered 
defense system. 

In Paneth cells, dysfunctional autophagy- 
related 16-like 1 (ATGI6L1) led to impaired 
autophagy, altered granule exocytosis, and 
reduced production of AMPs in mice (4). 
Deletion of X-box-binding protein 1 (XBP1)—a 
transcription factor of the unfolded protein 
response—in Paneth cells led to endoplasmic 
reticulum (ER) stress and caused increased 
cell death in the intestinal crypts of mice. 
ATGI6L1 deficiency together with XBP1 im- 
pairment further result in severe spontane- 
ous Crohn’s disease—like ileitis in mice (5). 
Therefore, malfunctioning Paneth cells in- 
crease susceptibility to intestinal pathologies, 
primarily as a result of the impairment of 
packaging and secretion of AMPs (4, 6). 

Imbalance (dysbiosis) of fungal microbi- 
ota has been associated with inflammatory 
bowel disease. During intestinal disease, one 
prominent opportunistic fungus, Candida 
albicans, damages macrophages and intesti- 
nal epithelium (7, 8) and further aggravates 
intestinal inflammation through a hyphae- 
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and slgA. The secretory function of Paneth cells is dependent on ATG16L1- and XBP1-dependent pathways. 
These cells also secrete PYY,.3., which exhibits selective antifungal activity against Candida albicans hyphal 
morphotypes in the gut. Whether PYY,.,, secretion is also regulated by the cytokines IL-17 and IL-22 is unknown. 
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produced toxin called candidalysin in a strain- 
dependent manner (7). Several mechanisms 
exist to limit C. albicans pathogenicity in the 
GI tract. Secretory immunoglobulin A (sIgA), 
which is secreted into the intestinal lumen, 
mediates regulation of C. albicans hyphal 
growth and shapes the gut mycobiota dur- 
ing homeostasis (9, 10), but this regulation 
is impaired in Crohn’s disease (9). In addi- 
tion, secreted mucin O-glycans (which form 
the mucus layer) suppress C. albicans hyphal 
morphogenesis by inducing the expression 
of NRGI1 (J), a transcriptional repressor of 
filamentation. AMPs such as LL-37 and B- 
defensin 3 promote resistance to C. albicans 
colonization in the noninflamed gut or the 
oral mucosa (12, 13). 

Pierre et al. made the unexpected observa- 
tion that Paneth cells in both the mouse and 
human small intestinal epithelium produce 
PYY,,,- The shorter version, PYY,,,, gener- 
ated after N-terminal proteolysis of PYY,,., 
is an appetite control hormone produced by 
EECs, so the presence of PYY,,, in Paneth 
cells was unexpected. PYY,,, exhibited 
broad antimicrobial activity against several 
Candida species and less-pronounced activ- 
ity against diverse bacterial species (see the 
figure). Consistent with previous findings 
that highlight the yeast-to-hyphae transition 
as a key virulence process in C. albicans (7- 
IZ), the authors found that PYY,,, mitigates 
fungal pathogenesis by influencing hyphal 
formation. Specificially, PYY,,, induces con- 
centration-dependent membrane permeabili- 
zation and disruption of hyphal growth, with 
less impact on yeast morphotypes. Cationic 
PYY,,, interacts with the anionic surface 
charge of fungal hyphae, further elucidating 
its mechanism of action. 

Using ex vivo mouse distal ileal loops ex- 
posed to C. albicans, Pierre et al. discovered 
that PYY,,, is retained in the mucus layer 
owing to its charge affinity, which protects it 
from enzymatic cleavage into PYY, ,.. In mice 
with prominent C. albicans gut colonization 
as a result of antibiotic treatment, admin- 
istration of PYY,,, reduced C. albicans bur- 
den. Analysis of the mucus layer of the gut 
revealed increased C. albicans abundance in 
PYY-deficient mice compared with wild-type 
mice, highlighting the role of PYY, ,, in mod- 
ulating gut fungal colonization. Overall, this 
study reveals the antimicrobial properties of 
PYY,,, against C. albicans and provides valu- 
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able insights into how IECs contribute to gut 
mycobiota regulation. 

XBP1 and ER stress in IECs promote 
sIgA secretion, which prevents patho- 
gen access to the epithelial surface of the 
GI tract (14). How molecular mechanisms, 
such as ATGI16LI-dependent autophagy 
and XBPI-regulated unfolded protein re- 
sponse, mediate Paneth cell secretory 
functions and might directly or indirectly 
affect the secretion of PYY,,, remain to 
be determined. Interleukin-17 (IL-17) is 
critical for antifungal mucosal immunity 
and induction of AMPs (13). In the gut, 
C. albicans colonization promotes T helper 
17 (T,,17) immune responses through the 
interaction with CX3C chemokine recep- 
tor 1 (CX3CR1)-expressing mononuclear 
phagocytes (MNPs), which express several 
antifungal receptors (15). Whether PYY_,, 
secretion is induced directly by C. albicans 
or indirectly through CX3CR1* MNPs and 
IL-17 requires further investigation. 

Although Pierre et al. demonstrate the in 
vitro inhibition of C. albicans hyphal forma- 
tion, it remains unclear whether PYY,,, di- 
rectly affects the yeast-to-hyphae program or 
whether this is a function of C. albicans im- 
paired growth upon exposure to the peptide. 
Furthermore, because bacteria do not pro- 
duce hyphae, it is unclear how PYY,,, exerts 
antibacterial effects. Investigation into the 
cellular targets of PYY,,, in fungi and bacte- 
ria could help elucidate differential efficacy. 
Given the observation by Pierre et al. of bacte- 
rial composition alterations in PYY-deficient 
mice, the indirect suppression of hyphal 
morphogenesis through interkingdom inter- 
actions could also contribute. Nonetheless, 
understanding the evolution that led to the 
production of such antifungal molecules will 
enrich our knowledge of the intricate host- 
microbe interactions at the mucosal surfaces 
of the GI tract. 


REFERENCES AND NOTES 


1. J.F.Pierre et al., Science 381, 502 (2023). 
2. S.Vaishnavaetal., Proc. Natl. Acad. Sci. U.S.A.105, 20858 
(2008). 
. N.H.Salzmanetal., Nat. !mmunol. 11,76 (2010). 
. K.Cadwelletal., Nature 456, 259 (2008). 
. T.E.Adolphetal., Nature 503, 272 (2013). 
. T.S.Stappenbeck, D. P.B. McGovern, Gastroenterology 
152, 322 (2017). 
7. X.V.Lietal., Nature 603, 672 (2022). 
8. L.Kasper etal., Nat. Commun.9, 4260 (2018). 
9. |.Doronetal., Nat. Microbiol. 6, 1493 (2021). 
10. K.S.Ostetal., Nature 596, 114 (2021). 
ll. J. Takagietal., Nat. Cher. Biol.18, 762 (2022). 
12. D.Fanetal., Nat. Med. 21,808 (2015). 
13. H.R.Contietal., Cell Host Microbe 20, 606 (2016). 
14. J.Grootjans etal., Science 363, 993 (2019). 
15. |.Leonardietal., Science 359, 232 (2018). 


ao Ww 


ACKNOWLEDGMENTS 


The authors were supported by National Institutes of Health 
grants (DK113136, DK121977, and Al163007), a Burroughs 
Wellcome Fund PATH award, and a CIFAR award. 


10.1126/science.adj1240 


MATERIALS SCIENCE 


Ferroelectric 
polymers take 
a step toward 
bioelectronics 


Chemical modification 
opens new applications 
for polymers in wearables 


By Han-Yue Zhang? and Ren-Gen Xiong!” 


erroelectrics are electroactive materi- 

als whose spontaneous polarization 

can be reoriented under external 

electric stimuli, and they have wide 

applications including in infrared 

detectors and sensors in wearable 
electronics (1). Ferroelectric materials for 
wearable and bioelectronic devices should 
be soft, pliable, and elastic to be com- 
patible with the human body (2). These 
requirements impose challenges for con- 
ventional ferroelectric materials such as 
inorganic oxides, which are generally rigid 
and brittle (3). Ferroelectric polymers 
have advantages such as being mechani- 
cally flexible and lightweight. However, 
poly(vinylidene difluoride) (PVDF), the 
dominant ferroelectric polymer, tends to 
exhibit unrecoverable plastic deformation 
(4). On page 540 of this issue, Gao et al. 
(5) report a PVDF-based, inherently elas- 
tic ferroelectric that is achieved by partial 
cross-linking—a process linking two poly- 
mer chains through covalent bonds. The 
polymer shows stable ferroelectricity even 
under a large mechanical deformation, 
enabling applications where elasticity is 
required. 

Ferroelectricity mainly depends on long- 
range polar order in crystalline materials 
and polymers’ crystalline regions, where 
atoms, molecules, or ions present long- 
range positional ordering. This means that 
good ferroelectric performance requires 
high crystallinity. The first ferroelectric 
Rochelle salt (potassium sodium L-tar- 
trate tetrahydrate), a classical chiral ionic 
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produced toxin called candidalysin in a strain- 
dependent manner (7). Several mechanisms 
exist to limit C. albicans pathogenicity in the 
GI tract. Secretory immunoglobulin A (sIgA), 
which is secreted into the intestinal lumen, 
mediates regulation of C. albicans hyphal 
growth and shapes the gut mycobiota dur- 
ing homeostasis (9, 10), but this regulation 
is impaired in Crohn’s disease (9). In addi- 
tion, secreted mucin O-glycans (which form 
the mucus layer) suppress C. albicans hyphal 
morphogenesis by inducing the expression 
of NRGI1 (J), a transcriptional repressor of 
filamentation. AMPs such as LL-37 and B- 
defensin 3 promote resistance to C. albicans 
colonization in the noninflamed gut or the 
oral mucosa (12, 13). 

Pierre et al. made the unexpected observa- 
tion that Paneth cells in both the mouse and 
human small intestinal epithelium produce 
PYY,,,- The shorter version, PYY,,,, gener- 
ated after N-terminal proteolysis of PYY,,., 
is an appetite control hormone produced by 
EECs, so the presence of PYY,,, in Paneth 
cells was unexpected. PYY,,, exhibited 
broad antimicrobial activity against several 
Candida species and less-pronounced activ- 
ity against diverse bacterial species (see the 
figure). Consistent with previous findings 
that highlight the yeast-to-hyphae transition 
as a key virulence process in C. albicans (7- 
IZ), the authors found that PYY,,, mitigates 
fungal pathogenesis by influencing hyphal 
formation. Specificially, PYY,,, induces con- 
centration-dependent membrane permeabili- 
zation and disruption of hyphal growth, with 
less impact on yeast morphotypes. Cationic 
PYY,,, interacts with the anionic surface 
charge of fungal hyphae, further elucidating 
its mechanism of action. 

Using ex vivo mouse distal ileal loops ex- 
posed to C. albicans, Pierre et al. discovered 
that PYY,,, is retained in the mucus layer 
owing to its charge affinity, which protects it 
from enzymatic cleavage into PYY, ,.. In mice 
with prominent C. albicans gut colonization 
as a result of antibiotic treatment, admin- 
istration of PYY,,, reduced C. albicans bur- 
den. Analysis of the mucus layer of the gut 
revealed increased C. albicans abundance in 
PYY-deficient mice compared with wild-type 
mice, highlighting the role of PYY, ,, in mod- 
ulating gut fungal colonization. Overall, this 
study reveals the antimicrobial properties of 
PYY,,, against C. albicans and provides valu- 
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able insights into how IECs contribute to gut 
mycobiota regulation. 

XBP1 and ER stress in IECs promote 
sIgA secretion, which prevents patho- 
gen access to the epithelial surface of the 
GI tract (14). How molecular mechanisms, 
such as ATGI16LI-dependent autophagy 
and XBPI-regulated unfolded protein re- 
sponse, mediate Paneth cell secretory 
functions and might directly or indirectly 
affect the secretion of PYY,,, remain to 
be determined. Interleukin-17 (IL-17) is 
critical for antifungal mucosal immunity 
and induction of AMPs (13). In the gut, 
C. albicans colonization promotes T helper 
17 (T,,17) immune responses through the 
interaction with CX3C chemokine recep- 
tor 1 (CX3CR1)-expressing mononuclear 
phagocytes (MNPs), which express several 
antifungal receptors (15). Whether PYY_,, 
secretion is induced directly by C. albicans 
or indirectly through CX3CR1* MNPs and 
IL-17 requires further investigation. 

Although Pierre et al. demonstrate the in 
vitro inhibition of C. albicans hyphal forma- 
tion, it remains unclear whether PYY,,, di- 
rectly affects the yeast-to-hyphae program or 
whether this is a function of C. albicans im- 
paired growth upon exposure to the peptide. 
Furthermore, because bacteria do not pro- 
duce hyphae, it is unclear how PYY,,, exerts 
antibacterial effects. Investigation into the 
cellular targets of PYY,,, in fungi and bacte- 
ria could help elucidate differential efficacy. 
Given the observation by Pierre et al. of bacte- 
rial composition alterations in PYY-deficient 
mice, the indirect suppression of hyphal 
morphogenesis through interkingdom inter- 
actions could also contribute. Nonetheless, 
understanding the evolution that led to the 
production of such antifungal molecules will 
enrich our knowledge of the intricate host- 
microbe interactions at the mucosal surfaces 
of the GI tract. 
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Ferroelectric 
polymers take 
a step toward 
bioelectronics 


Chemical modification 
opens new applications 
for polymers in wearables 


By Han-Yue Zhang? and Ren-Gen Xiong"? 


erroelectrics are electroactive materi- 

als whose spontaneous polarization 

can be reoriented under external 

electric stimuli, and they have wide 

applications including in infrared 

detectors and sensors in wearable 
electronics (1). Ferroelectric materials for 
wearable and bioelectronic devices should 
be soft, pliable, and elastic to be com- 
patible with the human body (2). These 
requirements impose challenges for con- 
ventional ferroelectric materials such as 
inorganic oxides, which are generally rigid 
and brittle (3). Ferroelectric polymers 
have advantages such as being mechani- 
cally flexible and lightweight. However, 
poly(vinylidene difluoride) (PVDF), the 
dominant ferroelectric polymer, tends to 
exhibit unrecoverable plastic deformation 
(4). On page 540 of this issue, Gao et al. 
(5) report a PVDF-based, inherently elas- 
tic ferroelectric that is achieved by partial 
cross-linking—a process linking two poly- 
mer chains through covalent bonds. The 
polymer shows stable ferroelectricity even 
under a large mechanical deformation, 
enabling applications where elasticity is 
required. 

Ferroelectricity mainly depends on long- 
range polar order in crystalline materials 
and polymers’ crystalline regions, where 
atoms, molecules, or ions present long- 
range positional ordering. This means that 
good ferroelectric performance requires 
high crystallinity. The first ferroelectric 
Rochelle salt (potassium sodium L-tar- 
trate tetrahydrate), a classical chiral ionic 
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Ferroelectric polymer achieves elastic recovery 

Achemical modification has been developed to confer elasticity to the ferroelectric polymer poly(vinylidene 
fluoride-trifluoroethylene) [P(VDF-TrFE)] by adding the cross-linker polyethylene glycol (PEG) diamine. The 
partially cross-linked P(VDF-TrFE) films largely retain the polar 8 phase, exhibiting elastic recovery and robust 
ferroelectricity under a strain that stretches the film's length by 70%. 


Recoverable 


fF Fok a F Partially cross-linked polymer 
POERO, ttkthebit a 

F N FFF F N - } } 
P(VDF-TrFE) plastic chain ; | | 

+  =o—————— Elastic network Peace 

) ) | 
H 0) H —————» 
od box r As v $e 
N N 


PEG diamine soft chain 


x,y,m, and n are the number of repeat 
units of cross-linked P(VDF-TrFE); q is x nx 
the number of repeat units in PEG diamine. F 


a 


FoR F 


} 
F i 


— / 


[ £9 


crystal, was discovered in 1920 (6). Inor- 
ganic oxide perovskite ferroelectric ceram- 
ics such as barium titanate (BaTiO,) and 
lead-zirconium titanate [Pb(Zr,Ti)O,] then 
emerged with a wide range of applications 
such as ultrasound imaging (7). These ma- 
terials are rigid and brittle, and suffer frac- 
ture under mechanical deformation. They 
show elastic tensile strain—a type of strain 
that is recoverable under stretching—be- 
low 0.2%, meaning the material can only 
be stretched 0.2% if it is to recover its orig- 
inal shape (8). Materials based on PVDF, 
whose ferroelectric behavior was first dis- 
covered in the 1970s, are semi-crystalline 
polymers (9). In general, they exhibit good 
plasticity—they change shape permanently 
in response to an applied 
force—but poor elasticity— 
they are unable to return to 
their original form when the 
force is removed. They have 
a small elastic tensile strain 
of less than 2% (8). 
Materials with good elas- 
ticity, such as rubber, can 
be found everywhere from 
natural products to synthetic 
materials. They are usually 
amorphous polymers with a low glass tran- 
sition temperature (Tp the temperature at 
which the polymers transition from a rigid 
“glassy” state to a flexible “rubbery” state) 
(10). Likewise, the crystallinity of polymers 
should be relatively low to achieve a good 
elastic recovery. In the past few years, 
chemical cross-linking has gained wide- 
spread attention as a simple and effective 
method to obtain elastic ferroelectrics (1). 
Nevertheless, conventional chemical cross- 
linking severely decreases the polymer’s 
crystallinity, which in turn might weaken 
the ferroelectric response. It is thus a chal- 
lenge to synergistically maintain both good 
ferroelectricity and high elastic resilience 
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“ «chemical 
modifications can 
‘tune’ the plastic 

ferroelectric 
polymer into an 
elastomer...’ 


in one material, which is important for ap- 
plications in wearable electronic devices. 

PVDF has five different crystalline phases 
(a, B, y, 6, and e). Among them, the polar 
6 phase shows the highest ferroelectric re- 
sponse because of the well-oriented polar 
structure of the polymer chain, which is in 
an all-trans conformation (72). The addi- 
tion of bulky trifluoroethylene (TrFE) units 
into PVDF favors the formation of the po- 
lar 8 phase in the copolymer P(VDF-TrFE), 
which results in enhanced ferroelectricity 
(13). This suggests that inducing the for- 
mation of the polar B phase is essential for 
improving ferroelectricity in PVDF-based 
ferroelectric polymers. In this sense, the 
elasticity of ferroelectric polymers could be 
realized by a chemical modifi- 
cation that substantially pre- 
serves the B phase. 

Gao et al. developed a low- 
density strategy whereby 1 
to 2% of the total repeating 
units of the polymer chains 
are cross-linked to improve 
the elastic performance of the 
ferroelectric polymer P(VDF- 
TrFE). This partial cross- 
linking using the soft- and 
long-chain cross-linker polyethylene glycol 
(PEG) diamine imparts elastic resilience and 
high crystallinity to achieve a good ferro- 
electric response. The resulting intrinsically 
elastic ferroelectric polymer exhibits excel- 
lent elastic recovery with an elastic tensile 
strain as large as 125% (see the figure), which 
is at least one order of magnitude larger than 
previously reported in PVDF-based polymers 
(8). It also shows a robust ferroelectric re- 
sponse even up to 70% strain, that is, where 
the stretched length increases by 70% com- 
pared to its initial state. 

Gao et al. found that the modified P(VDF- 
TrFE) with an optimized cross-linking den- 
sity of 1.44% showed the highest degree of 


crystallinity and good elasticity. In addition, 
cross-linking mainly occurs in the amor- 
phous region of the semi-crystalline poly- 
mer because of the immiscibility of PVDF 
and PEG diamine, which largely preserves 
the B phase in the crystalline region and 
likely accounts for the observed ferroelec- 
tricity. Gao et al.’s results show that chemi- 
cal modifications can “tune” the plastic 
ferroelectric polymer into an elastomer, 
turning it from an irreversibly into a revers- 
ibly deformable ferroelectric material. 

Further studies should focus on fer- 
roelectric elastomers with a lower T, and 
higher Curie temperature (the tempera- 
ture above which ferroelectricity vanishes) 
because this would result in a wider work- 
ing temperature window. Elastomers with 
high piezoelectricity—that is, with the 
ability to convert mechanical energy into 
electrical energy or vice versa—should also 
be further explored. Improving elasticity 
in P(VDF-TrFE) after partial cross-linking 
sacrifices its piezoelectric coefficient, d,,., 
from around 40 to 11.3 pm/V (13), which 
results in lower efficiency in converting 
the input electrical signal to mechanical 
deformation. Whether other elastic poly- 
mers can be endowed with ferroelectric- 
ity should also be explored. For example, 
the introduction of chirality in elastomers 
increases the probability of adopting fer- 
roelectricity-inducing polar structures 
(14). Therefore, using homochiral building 
units, such as chiral monomers and cross- 
linkers, could allow ferroelectricity to be 
rationally designed in elastomers (14). On 
the basis of their study, Gao et al. have es- 
tablished a new research direction, elastic 
ferroelectrics. 
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SLEEP the water. Captive phocids sleep floatin siete 


Diverse sleep strategies at sea 


Findings in marine mammals and birds provide 
opportunities to explore sleep’s functions 


By Niels C. Rattenborg 


akefulness allows animals to in- 

teract with their environment, 

but in many species, the efficacy 

of wakefulness depends on sleep 

(1), a seemingly dangerous state of 

reduced environmental awareness 
(2). Sleep has been proposed to serve several 
functions for the brain, including repairing 
neuronal damage (3), strengthening or weak- 
ening synapses and associated memories (4), 
and removing toxic metabolic waste (5, 6). 
However, many animals face ecological chal- 
lenges that seem incompatible with sleep. In 
marine mammals and birds—which need to 
sleep, breathe, and avoid predators—diverse 
neurophysiological, respiratory, and behav- 
ioral solutions to these ecological challenges 
have evolved. Notably, electroencephalogram 
(EEG) studies in great frigatebirds (Fregata 
minor) (7), and northern elephant seals 
(Mirounga angustirostris) (8) indicate that 
these species sleep very little for long periods 
at sea. These findings pose a potentially in- 
formative challenge to current theories about 
the restorative function of sleep. 

Sleep in mammals, birds, and perhaps 
other taxonomic groups is composed of two 
main types: slow-wave sleep (SWS) and rapid 
eye movement (REM) sleep (9). SWS is char- 
acterized by large, slow (0.5 to 4.0 Hz) brain 
waves when compared with the small, fast 
waves of wakefulness. EEG activity during 
REM sleep is similar to that of wakefulness, 
but both eyes are closed, awareness is di- 
minished, and aside from occasional twitch- 
ing, muscle tone is reduced. Whereas REM 
sleep occurs simultaneously in both cerebral 
hemispheres, SWS can occur bihemispheri- 
cally (BSWS) or in one hemisphere at a time 
(unihemispherically, USWS). During BSWS, 
both eyes are usually closed, whereas dur- 
ing USWS, only the eye opposite the sleep- 
ing hemisphere is closed. Among mammals, 
USWS has been found in some marine spe- 
cies (10), whereas it occurs to varying degrees 
in several terrestrial and aquatic birds (2, 
7). While sleeping with one eye open, some 
seals and birds also exhibit asymmetric SWS 
(ASWS), an intermediate state on a contin- 
uum between USWS and BSWS. 
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Dolphins and porpoises primarily sleep 
unihemispherically (10). During USWS, bot- 
tlenose dolphin (Tursiops truncatus) mothers 
and their calves, which swim continuously 
during the first month postpartum, keep the 
open eye facing each other to avoid drifting 
apart (see the figure). Constant activity is 
thought to counteract heat loss to the water 
in newborns, which have a high surface area 
to body mass ratio (J0). As well as swimming 
and surfacing to breathe, dolphins in USWS 
can float at the surface or hold their breath 
and rest on the bottom (JO). In addition to 
USWS, behavioral signs of REM sleep, such 
as occasional twitching, have been observed 
in belugas (Delphinapterus leucas) and gray 
whales (Eschrichtius robustus) sleeping on 
the bottom, but unequivocal electrophysio- 
logical evidence of REM sleep is lacking in ce- 
taceans (10). Therefore, cetaceans might have 
little, if any, REM sleep, or it occurs in a man- 
ner that is difficult to detect. Alternatively, 
the conditions in captivity, where these elec- 
trophysiological recordings were performed, 
might cause cetaceans to suppress REM sleep 
(0). In the wild, groups of sperm whales 
(Physeter macrocephalus) periodically hang 
vertically near the surface, a posture attained 
passively (77). Anecdotal evidence suggests 
that hanging whales are less responsive 
to disturbances in their environment, and 
therefore might be asleep (77). However, it is 
unknown whether this behavior is associated 
with REM sleep or some form of SWS. 

Pinnipeds in the families Otariidae (eared 
seals: fur seals and sea lions) and Phocidae 
(earless, or true seals) differ markedly in the 
way they sleep at sea. On land, northern fur 
seals (Callorhinus ursinus) engage in REM 
sleep and all types of SWS, whereas when 
forced to sleep in the water they primarily 
engage in USWS, like dolphins (12). However, 
unlike dolphins, fur seals float on their side 
at the surface with the tail flippers bent out 
of the water and in contact with the upward- 
facing front flipper. The head is also held up 
to allow breathing. To maintain this posture, 
the front flipper in the water paddles slowly, 
while the open eye facing into the water col- 
umn watches for predators. In addition, by 
keeping most of the flippers out of the water 
and exposed to solar radiation, this posture 
might decrease heat loss (J0). 

In contrast to otariids, phocids engage in 
BSWS and REM sleep while on land and in 


the surface, either horizontally or verticauy, 
and while resting on the bottom of a pool 
(10). The EEG of northern elephant seals was 
recorded in the wild with the use of a water- 
proof electrode cap (8). The elephant seals 
slept as previously described for captive pho- 
cids when in shallow water. However, when 
in the deep open ocean, they initiated a dive 
while awake and then slept as they passively 
drifted down through the water column. 
During typical dives, elephant seals engaged 
in BSWS in an upright posture and then REM 
sleep, during which they rolled upside down 
and spiraled down in a tight circle, likely due 
to a loss of muscle tone. Upon awakening, 
the elephant seals actively swam back to the 
surface. These sleep-dives were not usually 
followed by foraging at depth, indicating that 
they are performed exclusively for sleep and 
at the cost of the energy needed to resurface. 
By retreating hundreds of meters below the 
surface to sleep, it has been proposed that ele- 
phant seals reduce the risk of predation from 
sharks and orcas (Orcinus orca), which hunt 
closer to the surface (8). Despite this evasive 
strategy, elephant seals sleep <2 hours/day 
while at sea, compared with 11 hours/day on 
land, where, on the basis of EEG activity, they 
also appear to sleep deeper. 

Marine birds sleeping on the surface are 
also easy prey for aquatic predators. Although 
many marine birds, such as _ albatrosses, 
spend several hours on the surface at night, 
likely sleeping, great frigatebirds do not land 
on the water, despite spending months at 
sea (13). As sleeping on the surface is risky, 
predation pressure might have contributed 
to the evolution of the frigatebirds’ unusual 
flight and sleep strategy. EEG recordings re- 
vealed that great frigatebirds engaged in all 
types of sleep while soaring in spirals on ris- 
ing air currents at night and mostly consisted 
of ASWS and USWS (7). A strong relationship 
between which way a bird turned and which 
hemisphere was sleeping suggested that 
they keep the eye leading into a turn open. 
Because they have no aerial predators, frig- 
atebirds probably do this to avoid mid-flight 
collisions. Moreover, similar to elephant seals 
at sea, frigatebirds sleep very little (<1 hour/ 
day) while on the wing. Once back on land, 
they sleep >12 hours/day, with bouts of SWS 
being longer, deeper, and more symmetric. 

The studies of marine mammals and birds 
both inspire and challenge hypotheses for 
sleep. Because REM sleep is typically associ- 
ated with a complete loss of postural muscle 
tone in mammals, the finding that captive 
newborn bottlenose dolphins swim continu- 
ously during the first month after birth (0) 
suggests that, despite engaging in USWS, 
they do not engage in REM sleep during this 
period. By contrast, newborn altricial mam- 
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Avoiding predation while marine animals sleep 

To avoid predators, bottlenose dolphins and northern fur seals sleep unihemispherically with one eye 
open. Northern elephant seals hold their breath and sleep bihemispherically (both eyes closed) as they 
spiral down to safer depths. Great frigatebirds sleep unihemispherically and asymmetrically, as 

they spiral up on rising air currents. Sperm whales sleep in groups, hanging vertically at shallow depths. 
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mals (born with underdeveloped brains and 
bodies) spend longer in REM sleep than 
when they are adults (9). On the basis of this 
pattern and the wake-like activation of the 
brain and twitching occurring during REM 
sleep, it is thought that REM sleep provides 
the brain with stimulation for its develop- 
ment. The findings in newborn dolphins 
seemingly challenge this view. However, pre- 
cocial mammals (born with well-developed 
brains and bodies), which exhibit adult levels 
of REM sleep shortly after birth, exhibit more 
REM sleep prior to birth (9). Consequently, 
REM sleep might still occur and play a role in 
brain maturation before birth in cetaceans, 
which are highly precocial. 

Why do dolphins and fur seals sleeping in 
the water exhibit little or no REM sleep? It 
has been proposed that REM sleep, which 
normally follows a period of brain cool- 
ing during BSWS, rewarms the brainstem, 
thereby preparing the animal to move quickly 
upon awakening (12). According to this hy- 
pothesis, the maintenance of motor activity 
in dolphins and fur seals during USWS keeps 
the brainstem warm. Therefore, dolphins 
have no need for REM sleep, and fur seals 
only require it while on land when SWS can 
occur bihemispherically. A direct test of this 
hypothesis would be to determine whether 
brainstem temperature is higher during 
USWS than BSWS in fur seals. 

The diverse adaptations that marine 
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mammals and birds use to sleep at sea em- 
phasize its importance. If sleep was simply a 
default strategy used to enforce inactivity at 
times of the day when it would be dangerous 
and energetically unproductive to be awake, 
as has been suggested (14), then dolphins, 
fur seals, and frigatebirds should sleep bi- 
hemispherically. Instead, they open one eye 
and sacrifice sleep in one hemisphere during 
USWS, apparently to mitigate the conflict 
between the simultaneous need for sleep and 
vigilance (2). This compromise suggests that 
SWS is dangerous and performs important 
functions for the brain. Indeed, the discov- 
ery of USWS in dolphins nearly 50 years ago 
laid the conceptual groundwork for theories 
that SWS restores neuronal wear and tear 
resulting from localized cortical use during 
wakefulness (10). 

The discovery that elephant seals and 
frigatebirds sleep little while at sea poses a 
potential challenge to restorative theories 
for the function of sleep. By sleeping little, 
these animals have less time for restoration 
and should accumulate a greater need for re- 
covery from the wear and tear of prolonged 
wakefulness. In this respect, studies of short- 
sleeping animals provide an opportunity to 
test restorative theories proposing, for exam- 
ple, that sleep repairs neuronal DNA double- 
strand breaks (3), prevents the excessive accu- 
mulation and strengthening of synapses (4), 
and removes toxic proteins accumulated dur- 


ing wakefulness, such as amyloid-B8 (AB) (5), 
which is implicated in neurodegeneration. If 
such theories are correct, short-sleeping ani- 
mals might trade off the ecological benefits 
of prolonged wakefulness in the short term 
for potential costs that affect fitness later in 
life. For example, when compared with long- 
sleeping animals, habitually short-sleeping 
animals might show signs of neurodegenera- 
tion earlier in life. However, it is also possible 
that AB accumulation during prolonged peri- 
ods with minimal sleep is cleared during ex- 
tended sleep once on land. Other restorative 
processes for the brain might be similarly 
postponed until the animals return to land 
and sleep. 

Historically, it was assumed that sleep 
functions cannot be performed during wake- 
fulness. However, it has been hypothesized 
that some sleep functions can be performed 
during wakefulness, albeit less efficiently and 
at a higher energetic cost (15). As such, this 
strategy would require increased access to 
food and physiological adaptations to extract 
this energy and direct it toward performing 
sleep functions while awake. In this manner, 
animals could minimize the accumulation of 
a sleep deficit and thereby the brain’s need 
to initiate sleep—the cause of many lapses 
in waking cognitive performance induced by 
sleep loss in humans, such as falling asleep 
while driving (7). Furthermore, if sleep func- 
tions can be fulfilled during wakefulness, 
there should be no long-term costs of the 
short-sleep strategy other than the increased 
demand for energy. A link between short 
sleep, increased energy intake, and increased 
neuronal mitochondria size would suggest an 
increase in neuronal energy demands. The 
expression during wakefulness of genes nor- 
mally expressed during sleep, and therefore 
implicated in its functions, would suggest 
that neuronal energy is being allocated to 
performing sleep functions during wakeful- 
ness. Future studies inspired by the diverse 
sleep strategies of marine mammals and 
birds will likely lead to a deeper understand- 
ing of sleep’s functions. & 
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Methods for assessing social and cultural losses 


Compensation for damages can and should address social and cultural impacts 


By Robin Gregory'2°, Philip Halteman’, 
Nicole Kaechele*+, Terre Satterfield? 


ecent decades have seen a sharp rise 
in public discontent, activism, and 
court cases regarding how social and 
cultural consequences of past events 
(e.g., the Deepwater Horizon oil spill 
in the United States) or on-going 
and proposed developments (e.g., the Trans 
Mountain oil pipeline expansion in western 
Canada) are assessed. Such assessments 
of terrestrial or marine impacts influence 
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both mitigation actions (to reduce adverse 
impacts) and compensation payments (to 
address remaining losses). Our collabora- 
tive experiences over several decades with 
Indigenous and non-Indigenous partners, 
community members, and decision-mak- 
ers in the United States and Canada have 
made clear the need to broaden the ana- 
lyst’s toolkit and, in turn, the information 
that reaches decision-makers. This requires 
assessment methods that link principles to 
practice and can aid community and state 
decision-makers in determining the full 


range of impacts, including not only eco- 
nomic and environmental effects but also 
the social, cultural, and psychological im- 
pacts associated with an action. 

There is a commonly held public percep- 
tion, supported by results of a national sur- 
vey, that decisions made by public officials 
about large projects often fail to adequately 
account for many impacts that lack mon- 
etary measures (J). We give particular at- 
tention to the distinction between tangible 
impacts such as changes in jobs, revenues, 
or environmental emissions, which can be 
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Wet’suwet’en Hereditary Chiefs who oppose the 
Coastal GasLink pipeline take part in a rally 
in Smithers, British Columbia, Canada, in 2020. 


enumerated and monetized, and intangi- 
ble impacts to cultural practices and sites, 
social relations, or mental health, which, 
despite their importance, are not typically 
measured in monetary terms. 

The goal of monetary compensation is 
to provide a “fair” level of redress (i.e., to 
“make as whole as possible” the adverse ef- 
fects of an action) consistent with the se- 
verity, magnitude, and duration of losses 
(2). When land has been developed in ir- 
reversible ways and thus full restitution is 
not possible, this goal is only aspirational 
or symbolic: Monetary payments alone can- 
not fully offset losses experienced by an 
affected community over many decades. 
Nevertheless, compensation can bring ad- 
ditional funds to a community as well as 
provide public acknowledgment of certain 
rights, experienced damages, and the trans- 
gressor’s fault (3). 

Though the issues and approaches we 
describe are applicable in many contexts, 
we illustrate them below with a focus on 
Indigenous communities, for whom the 
neglect of social and cultural losses in as- 
sessments of compensation is of particular 
concern. This neglect also violates the 2007 
United Nations Declaration on the Rights 
of Indigenous Peoples (UNDRIP), which 
affirmed the preexisting inherent rights 
of the more than 476 million Indigenous 
people around the world (4), including 
rights to compensation for loss or degrada- 
tion of lands, territories, and resources. Yet 
fulfilling this right is problematic, in part 
because of systemic racism and colonial- 
ism and in part because of shortcomings 
in methods used to identify, evaluate, and 
seek redress for damages. These method- 
ological failures exist despite the respon- 
sibility of assessments to be responsive to 
the welfare of all citizens and acknowledg- 
ments by federal or regional governments 
of the rights held by, and fiduciary duties 
owed to, Indigenous peoples. 

Although impact assessment methods 
have become more inclusive over recent 
decades (5), many of the analytic advances 
have not yet been brought into quantitative 
assessments of payments for social, cultural, 
governance, and other intangible losses 
(6), thereby biasing recommendations for 
mitigation, compensation, and decision- 


'Decision Research, Eugene, OR, USA. @Choiceworks Ltd., 
Halfmoon Bay, BC, Canada. “Institute for Resources, 
Environment and Sustainability, University of British Columbia, 
Vancouver, BC, Canada. ‘Compass Resource Management, 
Vancouver, BC, Canada. 5RIO Project Management, Vancouver, 
BC, Canada. Email: robin.gregory@ires.ubc.ca 


SCIENCE science.org 


making. We collaborated with social science 
colleagues and both leaders and members 
of many Indigenous communities (7) to de- 
velop a mixed-method approach for calcu- 
lating compensation payments for damages 
resulting from initiatives such as resource 
developments (e.g., open-pit mines), expro- 
priation of lands (e.g., for industrial uses), 
or environmental accidents (e.g., offshore 
oil spills). 

This approach uses techniques based in 
economic theory to quantify net losses over 
time that can sensibly be estimated in dol- 
lar terms. But where those techniques fall 
short—because the impacts in question af- 
fect intangible values typically not thought 
about in terms of a monetary exchange—we 
use methods based in multi-attribute utility 
theory and decision analysis (8), Indigenous 
methodologies (9), and behavioral decision 
research (JO) to estimate relative dollar 
amounts for net losses to intangible val- 
ues. This combination of methods at least 
partially overcomes the ethical and practi- 
cal limits of the usual one-size-fits-all ap- 
proach that accommodates only like-for- 
like (or, at best, like-for-proxy) comparisons 
and assessments. By contrast, we propose 
a mixed-method approach for assessing 
social and cultural losses that acknowl- 
edges and preserves the distinctiveness of 
tangible and intangible values and enables 
fair, robust, and transparent like-for-unlike 
comparisons when required by a specific le- 
gal or other context. 


METHODOLOGICAL REQUIREMENTS 

Every Indigenous community has its own 
system of governance and legal orders, and 
every classification of impacts will differ 
with respect to its nature, history, magni- 
tude, severity, and time period. That said, 
our experience suggests that the accuracy 
and defensibility of compensation efforts 
that seek to recognize and help restore 
experienced intangible as well as tangible 
losses depends on seven key principles (17) 
(see the box). 


Identify multiple dimensions of impact 

A defensible valuation of net impacts over 
time (gains minus losses) must recognize 
and formally incorporate the multiple di- 
mensions of valued qualities of life that 
have been affected, including not only eco- 
nomic, physical health, and environmental 
considerations but also impacts to cultural, 
nonphysical health, social, and governance 
systems. Implementing this usually re- 
quires considerable time—dozens of in- 
terviews, multiple trips to sites, frequent 
check-ins with community leadership, and 
extensive reviews of ethnographic and oral- 
historical records—to gain a comprehensive 


understanding of the nature, magnitude, 
and duration of losses experienced over the 
period in question, which often spans sev- 
eral decades. 

Helpful work has been done in related 
compensation contexts (e.g., the 1986 Exxon 
Valdez and 2010 Deepwater Horizon oil 
spills) (12). However, in most cases, the iden- 
tification of social and cultural values either 
has focused on individual losses related 
to economic and environmental services, 
thereby missing many place-based commu- 
nity relationships and governance systems, 
or was reported as part of a stand-alone 
report whose findings were not integrated 
into the summary quantitative analyses of 
impacts considered by decision-makers. 

In our work with Indigenous Nations, we 
find that the different categories of loss are 
often interwoven, reflecting the classifica- 
tions consistent with Indigenous knowl- 
edge with respect to their land- or marine- 
based economies. Although clarifying such 
interconnected impacts can be challenging, 
some representation of discrete categories 
(with community definitions and magni- 
tudes specific to each context) can aid in 
their understanding and analysis while 
also serving to educate and communicate 
to non-Indigenous persons the nature and 
extent of experienced losses. Examples of 
such multiple dimensions of impact include 
livelihood (hunting, fishing, trapping, and 
harvesting of plants for household use and 
nonmarket trade), cash economy (economic 
opportunities based in forestry, mining, 
fishing, trapping, or place-based tourism), 
social relations (relationships with family, 
the community, neighbors, and nonhuman 
kin), and cultural knowledge (use of lan- 
guage, ceremonial practices, or knowledge 
transfer as foundations of cultural identity). 


Recognize place-based relationships 
Members of many Indigenous Nations em- 
phasize relationships to places and people, 
including members living outside their Na- 
tion’s traditional territory. Lands and wa- 
ters are typically viewed as living sacred 
entities, and the well-being of individuals 
and the community rests on a mandate to 
protect ecosystem health and relations (13). 
Adverse impacts that occur without the 
community’s participation or free, prior, 
and informed consent on this deep prior- 
ity of place and stewardship can introduce 
enormous distress, diminishment of iden- 
tity, loss of cultural practices and knowl- 
edge, or other negative effects that may 
persist over decades (14). 

The strength and centrality of place- 
based considerations is noted in many 
Indigenous critiques of land management 
(15) and is also manifest across members of 
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non-Indigenous societies, including 
small-scale fishers, foragers, farm- 
ers, and recreationists. Although 
losses to these values are rarely in- 
cluded as part of monetary assess- 
ments of compensation, there are 
some promising examples, such as a 
recent project evaluation in British 
Columbia that considered the need 
for mitigation to protect specific 
stands of trees because the stories 
and teachings associated with them, 
which are basic to cultural continu- 
ity, cannot be told unless Elders and 
others are at the site where specific 
events took place. 


Respect community practices 

Compensation assessments typi- 
cally assume and aggregate the per- 
spectives of individuals and their 
personal values. However, many 
changes in cultural and social prac- 
tices experienced by an Indigenous 
community are more accurately 
valued from a collective perspective 
because stewardship is community- 
based, with resources shared and 
traded rather than transferred be- 
tween individuals through market 
exchanges. For example, families 
often engage in traditional hunting, 
fishing, or harvest activities based 
on reciprocal family or clan rela- 
tions that do not involve cash pay- 
ments. These systems of seasonal 
rounds and trade often involve 
established seasonal movements; 
Cree and Dene Elders, for example, 
describe groups of families going 
to different camps over the course 
of summer and fall. Losing this ac- 
cess can result in negative impacts 
to physical and mental health, in- 
tergenerational knowledge trans- 
mission, and social relations within 


Methodological requirements 

Seven key principles can promote the accuracy and 
defensibility of efforts to assess both tangible and 
intangible losses. 


Identify multiple dimensions of impact 

Though types of loss are often interwoven, 
representation of the full range of losses as discrete 
categories is required to aid in the understanding, 
analysis, and communication of impacts. 


Recognize place-based relationships 

Damages to lands and waters, often viewed as 

living sacred entities key to community relationships 
and well-being, can introduce enormous distress 
and disruption. 


Respect community practices 

Many changes in cultural and social practices are 
more accurately valued from a collective perspective 
because resources are shared and traded among 
community members. 


Communicate uncertainties in compensation estimates 
Uncertainties around data, calculations, and 
judgments, including proportional attribution of 
losses to different factors, should be made explicit 
and ideally include a range of estimates. 


Consult multiple factual sources 

Information from different sources should be 
collected, evaluated, and subjected to the same 
questions about the confidence of an expert, the 
basis for their judgments, and transparency in stating 
the limits of their knowledge. 


Account for shifts in impact severity over time 
Assessments need to reflect that some losses in 

the past may still be responsible for impacts in the 
present, whereas others may have been ameliorated 
by recovery, mitigation, or adaptation. 


Clarify relative importance of impacts 

Analysts can work with community members 

to clarify (and potentially weight) the relative 
importance of each impact category, yielding ratio- 
like valuations for intangible impact categories. 


and among families—valuations that are 
typically omitted from quantitative assess- 
ments because their measurement in dollar 
terms is considered problematic. 


Communicate uncertainties in 

compensation estimates 

Monetary compensation for past losses is 
based on both quantitative calculations, 
which in turn reflect the quality of best 
available data, and judgments based on 
logic and precedent. Both elements lead 
to uncertainties in payment amounts that 
need to be made explicit by documenting 
key sources of uncertainty and providing 
summary low, high, and best-estimate re- 
sults that delineate a reasonable range of 
compensation outcomes. 
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Data quality is a common contributor to 
uncertainty because key sources of informa- 
tion about the social, cultural, health, and 
well-being consequences of impacts will be 
either controversial (with different studies 
showing different results) or missing. For 
example, this is often the case with respect 
to nonphysical health information; statis- 
tics on hospital visits or fatalities related to 
an accident or imposed project may easily 
be available but not information on stress, 
mental health, or relational effects. 

In addition, other aspects of society and 
the environment are likely to have also 
changed during the period over which im- 
pacts are assessed. This requires that judg- 
ments be made by analysts and their logic 
clearly stated—after speaking with commu- 


nity members and gathering infor- 
mation from external sources—with 
respect to the proportion of losses 
for each impact category eligible for 
compensation. For example, fish- 
ing harvests may decrease after a 
marine oil spill, but other factors, 
such as temperature changes in the 
estuary owing to climate change or 
overfishing by commercial fleets, 
may account for a proportion of the 
loss over time and thereby reduce 
estimates of compensation relating 
to the spill. 


Consult multiple factual sources 
Different sources of information 
may provide substantially different 
factual results, and findings from 
field or modeling efforts of non- 
Indigenous scientists may vary or 
be at odds with the observations 
of Indigenous leaders (15). In such 
cases, dialogue and further inves- 
tigation are essential, and often 
involve asking the same validation 
and reliability questions of the var- 
ious sources. Decision-makers may 
be placed in the position of having 
to decide what information and 
which sources they trust, choices 
that, in our experience, should re- 
flect equally the confidence of a 
factual expert (either Indigenous 
or non-Indigenous), the basis on 
which their judgment is made, and 
their transparency in stating the 
limits of their knowledge. 
Clarifying sources of informa- 
tion for the categories of loss also 
helps to streamline the collection 
of factual information by focus- 
ing efforts on changes identified 
as both important and linked to 
the development or event in ques- 
tion. Although redress is typically 
specific to a single cause, many Indigenous 
communities have been subjected to ongo- 
ing damages from several discrete but re- 
lated causes whose effects are cumulative. 
Information relating to this legacy needs 
to be acknowledged as part of compensa- 
tion negotiations because it implies that 
even relatively short-term losses related to 
a development could result in substantial 
harm to the extent that the community has 
already suffered from long-term exposure to 
adverse impacts that jeopardized practices 
essential to its survival. 


Account for shifts in impact 

severity over time 

Quantitative calculations need to consider 
that some losses experienced in the past— 
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particularly acts of violence—may still 
be responsible for considerable impacts 
in the present, whereas other losses may 
have been ameliorated either by partial re- 
covery and mitigation of the impact or by 
adaptation over time on the part of com- 
munity members. The well-documented 
resilience of Indigenous communities im- 
plies that some degree of recovery may 
have occurred over the time period under 
consideration, at least with respect to some 
impacts. Analyses of changes over time 
can be used to adjust annual or decadal 
calculations of present-value monetary 
payments (e.g., adjusted by the consumer 
price index) to enhance the precision 
and defensibility of monetary compensa- 
tion estimates. 


Assess relative importance of impacts 
Absolute dollar amounts for tangible cat- 
egories of loss can often be calculated using 
conventional economic approaches (e.g., re- 
placement costs, adjustments for inflation). 
Intangible loss categories, however, involve 
nonmonetary descriptions and measures. 
Analysts can work with community mem- 
bers to clarify the relative importance of 
each impact category, yielding distinct 
ratio-like equivalents for the intangible 
impact categories without requiring their 
translation into dollar terms. If required by 
legislation or legal precedent, these weights 
are used to assign each intangible impact 
category a monetary value in relation to the 
tangible, dollar-based categories. 

Weighting of impact categories—a stan- 
dard practice in decision analysis—can be 
done in several ways (8). For example, as- 
sume a community has identified multiple 
impact categories and that the present 
value of category A is estimated to be $20 
million. Any impact category rated twice as 
important as category A will be considered 
to have a dollar-equivalent value of $40 
million, whereas an impact category half as 
important will be assigned a value of $10 
million. Adoption of a linear value model 
means that the importance weight on each 
category is used as a multiplier, where the 
sum across all categories thus equals the to- 
tal value of losses eligible for compensation. 

Because the process is likely to be unfa- 
miliar to participants and may be emotion- 
ally or cognitively challenging, it is helpful 
to discuss, reconcile, and potentially revise 
the relative weighting results with par- 
ticipants so that findings are accurate and 
stable. We have found that communities of- 
ten prefer impact weightings to occur with 
a small representative group, for example, 
hereditary and clan Chiefs, with the addi- 
tion of Elders and younger members where 
this is considered to be appropriate. 


SCIENCE science.org 


POLICY IMPLICATIONS 

Insights from social and natural scientists 
(1) and Indigenous-led critiques of assess- 
ment practices (3) are beginning to change 
the scope and focus of efforts to determine 
fair levels of compensation for past and on- 
going losses. Governments now recognize 
that historical precedents and previous 
judgments may not reflect present societal 
standards of fairness and justice and may 
be incompatible with contemporary Indige- 
nous rights (e.g., as designated in UNDRIP). 
Yet across multiple contexts of loss, ranging 
from local (e.g., the traditional lands of a 
single community) to global (e.g., compen- 
sating for climate change impacts as part 
of the Green Climate Fund), the challenge 
is for policy aspirations to be linked to and 
supported by the world of compensation 
methods, which continues to emphasize 
tangible and economic net losses while 
minimizing or ignoring the importance of 
many less-tangible social, cultural, non- 
physical health, and governance impacts. 

The need to include intangible impacts 
as part of monetary compensation is par- 
ticularly urgent with respect to Indigenous 
peoples for two reasons. First, the lack of 
precedence for the use of methods to de- 
fensibly calculate relative dollar values of 
social and cultural losses means that al- 
though policy recommendations are now 
being made that acknowledge the impor- 
tance of intangible impacts, the reality is 
that payment amounts for many important 
losses are considered to be incalculable. As 
a result, compensation is often partial and 
biased, with the types and magnitude of 
payments for losses included in agreements 
more often reflecting the relative negotiat- 
ing strengths of the different parties rather 
than a comprehensive identification of 
losses and a consistent or principled logic. 
Second, when compared with populations 
that are reliant on conventional market- 
based transactions, intangible social and 
cultural losses more profoundly influence 
the ability of Indigenous communities to 
sustain their cultural identity and liveli- 
hood. The failure to adopt methods that fa- 
cilitate inclusion of the full range and mag- 
nitude of impacts as part of compensation 
negotiations and court settlements is thus 
more damaging and the absence of proce- 
dural fairness more egregious. 

We recognize that determination of com- 
pensation awards, whether to Indigenous or 
non-Indigenous communities, involves and 
reflects many factors: societal values, con- 
stitutional rights, legal precedents, political 
objectives, and both the directness and con- 
sistency of the event and damage linkage. But 
the methodological approaches discussed 
here show that much can be done to bring 


less-tangible social and cultural losses into 
the formal compensation assessment frame- 
work by combining insights from economics, 
decision analysis, Indigenous methodologies, 
ecology, and anthropology. This across-disci- 
plines task is challenging, and the adoption 
of social and cultural impact assessment 
methods that more closely reflect the spirit 
of current policies holds broader societal and 
political implications with regard to society’s 
responsibility to compensate communities 
for experienced past (and, in many cases, 
continuing) damages. Negotiations, politics, 
and power will always play a role. But pub- 
lic policies regarding compensation will be 
more defensible and their implications more 
widely acceptable to the extent that impact 
assessments rely on and reflect the results of 
sound and logically consistent analytic meth- 
ods that address the full range of tangible 
and intangible losses. 
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The new cell theory 


A developmental biologist urges a return 
to cell-centric models of life 


By Rick Horwitz 


e live in an era that embraces a 
gene-centric view of life, one in 
which we aspire to understand 
cells, organisms, and pathology by 
analyzing genomes, mutations, and 
patterns of gene expression. This 
mindset is a product of a plethora of broadly 
accessible tools and databases for the genera- 
tion and analysis of genome sequences, cellu- 
lar profiles of gene expression, and methods 
to identify and mutate genes. However, de- 
spite its successes, for many, the genomic era 
has overpromised and underdelivered. Asso- 
ciating a gene or mutation with an abnormal 
behavior, for example, often reveals neither 
how the gene dictates the behavior nor the 
function of the gene in normal behavior. 
Furthermore, many behaviors and diseases 
arise as a result of the contributions of large 
numbers of genes, leaving researchers with 
the challenge of ascertaining the role of each 
gene and how genes work together to create 
particular phenotypes. 
The Master Builder, by developmental 
biologist Alfonzo Martinez Arias, questions 
this gene-centric view and proposes an al- 
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ternative, cell-centric take. Martinez Arias’s 
perspective is that genes themselves do not 
determine cellular morphology, organiza- 
tion, or function. An organism, he maintains, 
is best viewed as a collection of cells that use 
genes to produce tools that determine the 
cellular behaviors that generate a function- 
ing organism. The repertoire of 
expressed genes limits cellular 


options by determining what ek 
THE® 
MASTER ~ 


tools and materials are avail- 
able rather than serving as the 
“blueprint” for an organism. A 
compelling argument in favor of 
this perspective can be found in 
embryonic development, in which 
genomically identical cells do dif- 


BUILDER 


The Master Builder 


including studies of the human embryo (shown). 


for example, includes quoted passages from 
Hooke’s original publication, making the 
reader feel like they are there by the poly- 
math’s side as he makes his observations. 

The Master Builder brings readers up to 
date with a discussion of the recent progress 
in creating organoids and human embryos 
from stem cells and meditations on the 
ethical issues that surround such endeavors. 
Martinez Arias is not afraid to take on thorny 
ethical issues, offering insights and clarity. In 
chapter 7, for example, he considers the HeLa 
cell, the first immortalized human cell line 
derived without consent from cervical cancer 
cells taken from 31-year-old Henrietta Lacks 
in 1951. He argues that the genomes of mod- 
ern HeLa cell lines are so dramatically dif- 
ferent from the original cells, containing, for 
example, anywhere between 35 and 45 pairs 
of chromosomes (compared to most people’s 
23 pairs), that it raises the question of when 
we ought to consider such cells part of a dis- 
tinct individual or even “human” at all. 

Is a cell-centric view of biology the “right” 
perspective? It depends on what you are 
studying and the questions you are asking. 
As Martinez Arias points out, the cell-centric 
perspective is useful in developmental biol- 
ogy, in which concepts such as positional in- 
formation posit a central role for the cell. 

Another framework that has proven fruit- 
ful, however, is the physical-chemical per- 
spective, which views cells and organisms 
as dynamic molecular systems subject to the 
laws of physics and the principles of chem- 
istry. Furthermore, viewing an 
organism as a system of organs 
is sometimes more useful than 
viewing it through the lens of its 
component cells, genes, or mol- 
ecules. Each of these ways of en- 
visioning complex organisms is a 
vital thread that advances our un- 
derstanding of biology, and these 
threads are stronger when spun 


ferent things depending on their — us ie Arias together to form a yarn than when 
location and environment. haart they stand alone. 
2023, 352 pp. 


Although provocative, this shift 
away from thinking that once we understand 
genes, we will understand cells and the or- 
ganisms they comprise is timely and needed. 
It also marks a return to the primacy that the 
cell enjoyed when cell theory first emerged 
in the 19th century, before natural selection 
and the discovery of genes as its mechanistic 
driver captured scientists’ attention. 

Highlights of the book include the many 
fascinating historical and evolutionary per- 
spectives presented and Martinez Arias’s 
discussion of key experiments. His descrip- 


tion of Robert Hooke’s discovery of the cell, 


Overall, I greatly enjoyed read- 
ing this book, which accessibly illuminates 
the secrets of cells, from their complex work- 
ings to their amazing interactions, and offers 
insights into how they might have evolved. 
The only shortcomings were Martinez Arias’s 
frequent reminders about the limits of a 
gene-centric view of biology, which are 
sometimes a distraction from the otherwise 
lucid discussions in which they are em- 
bedded. The book would be a complemen- 
tary companion to Siddhartha Mukherjee’s 
recent book, The Song of the Cell. 

10.1126/science.adj0271 
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David Bohm’s unfinished revolution 


An anniversary offers scientists an opportunity to revisit the work of an audacious physicist 


By Alex Gomez-Marin 


qually at home among scientists and 
mystics, the American-British physi- 
cist David Bohm (1917-1992) was 
considered a “spiritual son” by Albert 
Einstein and a “science guru” by the 
Dalai Lama. Bohm’s career was filled 
with extraordinary collaborations and un- 
expected turns: He conducted his graduate 
studies under the supervision of Robert 
Oppenheimer; joined the faculty at Prince- 
ton University, where he worked closely 
with Einstein; and was subjected to a fed- 
eral investigation for his Communist affili- 
ations. He subsequently relocated to Brazil 
and later Israel before finally joining the 
faculty at Birkbeck College in London. 

As a preeminent theoretical physicist 
of the 20th century, Bohm was arguably 
the first to offer a sustainable alternative 
to the Copenhagen interpretation, the rul- 
ing ontological-epistemic omelet that to 
this day remains hard to unscramble. He 
also made innovative contributions to rela- 
tivity theory and plasma physics, and his 
reformulation of the Einstein-Podolsky- 
Rosen paradox inspired John Bell to craft 
his famous inequalities, the experimental 
validation of which was awarded the Nobel 
Prize in Physics in 2022. With 
his student Yakir Aharonov, 
Bohm discovered the quantum 
effect that bears their names, 
and with Basil Hiley, he further 
developed his initial “hidden 
variables” theory into an “onto- 
logical interpretation” of quan- 
tum mechanics and introduced 
the “quantum potential” as the 
underlying guide of the behavior 
of particles. 

Bohm didn’t stop there. He 
felt that the study of matter and mind 
went hand in hand. With neuroscientist 
Karl Pribram, he entertained the “holo- 
nomic brain” theory, an approach to hu- 
man consciousness that extends beyond 
the dominant tenets of mechanistic reduc- 
tionism. “Even the electron is informed 
with a certain level of mind,” he claimed. 


The reviewer is at the Instituto de Neurociencias, 
Consejo Superior de Investigaciones Cientificas— 
Universidad Miguel Hernandez de Elche, Alicante, Spain. 
Email: agomezmarin@gmail.com. 


SCIENCE science.org 


The Essential 
David Bohm 
Lee Nichol, ed. 
Routledge, 2003, 368 pp. 


He explored the nature of thought in a 
decades-long collaboration with Indian 
philosopher Jiddu Krishnamurti, exam- 
ining together how thought creates most 
(if not all) of the problems that it seeks 
to solve. This later gave rise to “Bohm 
dialogue,” a discussion practice that was 
intended to unveil our attachment to pre- 
suppositions and from whence creativity 
can emerge. 

This year marks the 20th anniversary 
of the publication of The Essential David 
Bohm, a volume conscientiously edited 
by Lee Nichol that presents a broad, non- 
technical overview of Bohm’s scholarship 
and includes a selection of the physicist’s 
original works, including unpublished 
seminars, personal letters, and transcribed 
dialogues. The anniversary offers scientists 
a timely opportunity to revisit the ideas of 
this rebellious thinker. 

Nichol structures the book in three 
main parts, each centered around the no- 
tion of order: “Universal Orders” draws 
from Bohm’s work on the physics of the na- 
ture of the Universe, perception, and con- 
sciousness; “Individual Orders” deals with 
the ego, the self-world image, and the value 
of the individual; and “Collective Orders” 
delves into social dialogue as a generative 
order. Rather than pursuing a 
“theory of everything,” Bohm 
sought a new “order” that would 
transcend the deep problems 
at the foundations of physics. 
Borrowing the image of an ink 
drop in a rotating glycerin cyl- 
inder, he conceived of one of his 
signature ideas: manifest reality, 
or what he called “the explicate 
order,’ as unfolding from (and 
enfolding into) what he called 
“the implicate order.” 

A second idea that Nichol brings to the 
fore is Bohm’s notion of “undivided whole- 
ness,” whereby the subject becomes an ac- 
tive participant of the object under inves- 
tigation rather than a detached observer. 
Often considered a vague concept, Bohm 
progressively elaborated his understand- 
ing of wholeness as innately dynamic, alive, 
and open-ended throughout his career. 

Nichol’s synthesis dispels the peren- 
nial claim that Bohm was trying to draft 
physics into some kind of quasiclassical 
Newtonianism. For decades, Bohm was 


A student of Oppenheimer, colleague of Einstein, 
and adviser to the Dali Lama, David Bohm pursued 
provocative ideas that, at times, defied disciplinarity. 


alone in keeping the Copenhagen door 
from closing, insisting that other views 
were possible and viable. The point, he 
maintained, was not whose interpretation 
was correct, but rather to avoid entrenched 
scientific dogma. 

An exemplar of free scientific inquiry in 
the face of dogmatic ridicule and margin- 
alization, Bohm was adept at identifying 
conceptual treasures hidden in plain sight. 
His revolution remains unfinished, but its 
seeds are starting to bloom in fundamental 
physics, consciousness research, and social 
studies. The Essential David Bohm offers 
readers an opportunity to think anew about 
the ideas that captured the imagination of 
the late physicist. 
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The demand for fried golden cicada, a local delicacy in China, has contributed to cicada population decline. 


Edited by Jennifer Sills 


Silent summers: 
The decline of cicadas 


The song of the cicadas, the acoustic 
backdrop of summers in China since the 
Shang Dynasty (circa 1600 to 1045 BCE) 
(1), has begun to fade. For example, the 
wild cicada population has declined by 
approximately 80% in Ningling, Henan 
Province, in the past decade (2). Today’s 
quiet summers highlight the need for 
urgent conservation action. 

Approximately 220 cicada species have 
been identified in China (7). Males ser- 
enade females during summer evenings 
by vibrating specialized abdominal organs 
(3). The cicada nymphs feed on vascular 
plant tissue, especially roots belowground 
(4). Although they sometimes harm host 
plants, cicadas perform important ecosys- 
tem roles, such as cycling nutrients and 
enhancing soil aeration (5, 6). They also 
serve as a vital food source for animals 
such as moles and birds, and their decom- 
posing bodies enrich the soil (7). 

Rapid urbanization, overharvesting, and 
climate change threaten cicada popula- 
tions. Concrete and compacted soil make it 
difficult for nymphs to surface (8). Habitat 
fragmentation and changes to plant com- 
munities caused by anthropogenic activi- 
ties affect their ability to roost, forage, 
mate, and call (9). Soil pollution leaves 
cicadas vulnerable to fungal and bacterial 
diseases (10), and human noise interferes 
with mating calls (7). To collect the golden 
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cicada (Cryptotympana atrata), a delicacy 
in northern China, people apply glue to 
tree trunks, with devastating impacts on 
cicadas and other wild insect populations 
(11). Flooding, exacerbated by climate 
change (72), inundates the nymphs’ under- 
ground chambers and damages emerging 
adults (5). For example, cicada numbers 
substantially declined in the wake of the 
2021 Henan floods (2). 

Addressing the plight of China’s cicadas 
requires several crucial steps. Unchecked 
urban sprawl and noise pollution must be 
limited. Sustainable farming practices are 
necessary to reduce soil contamination 
and habitat fragmentation. Regulations 
on cicada trapping and consumption 
should be established and strictly enforced. 
Global efforts, including citizen science, 
are needed to monitor and mitigate the 
impacts of climate change on these insects 
and their ecosystems. 
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Bills undermine Brazil’s 
environmental goals 


After 4 years of regression in Brazil’s envi- 
ronmental policies, the administration of 
President Luiz Inacio Lula da Silva has been 
trying to resume an environmental agenda 
(1). However, the Brazilian Chamber of 
Deputies—led by Arthur Lira, a supporter 
of former President Jair Bolsonaro (2)—has 
recently approved a provisional measure 
(MP) and a bill, MP1154,/2023 and PL490, 
that would hinder Brazil’s ability to control 
deforestation and threats to Indigenous 
lands. If these bills are approved by the 
Senate and signed into law by the president, 
global trade partners should reconsider their 
import agreements with Brazil. 

MP1154/2023 and PL490 won congres- 
sional approval through the support of 
representatives of large landholders, known 
as ruralists. If implemented, MP1154/2023 
would remove power from federal agencies, 
limiting their ability to enforce environ- 
mental protection. For example, depart- 
ments under the authority of the Ministry 
of the Environment, including the Rural 
Environmental Registry (CAR), would be 
moved to ministries that are controlled by 
ruralists (3, 4). The Ministry of Indigenous 
Peoples would be stripped of its power 
to designate new Indigenous lands (3, 

4). In addition, the Brazilian Institute of 

the Environment and Renewable Natural 
Resources (IBAMA) would no longer have 
veto power over the licensing of large 
infrastructure projects (5). If PL490 takes 
effect, it will extinguish all Indigenous lands 
demarcated since 1988 (6), likely leading to 
increased mining and agriculture in the vul- 
nerable Amazon region (7). 

The changes proposed in these bills would 
have immediate and devastating effects. 
Ruralists are pressing the construction of 
Highway BR-319, which would connect 
Porto Velho in Brazil’s notorious “arc of 
deforestation” to Manaus, in relatively intact 
central Amazonia (8, 9). Paving has not 
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yet begun, but the federal government has 
prioritized the project without consulting 
Indigenous peoples (9), making the envi- 
ronmental authority of independent federal 
agencies even more important. Currently, 
the Ministry of the Environment controls 
such environmental inspection mechanisms 
through the CAR (3, 4, 8), and IBAMA can 
veto the Highway BR-319 project if the 
Ministry determines that it poses unac- 
ceptable environmental harm. However, 
MP1154/2023 would place the decision in the 
hands of ruralists, and PL490 would remove 
Indigenous peoples’ legal mechanism to jus- 
tify land protection. 

Because the ruralists of the Brazilian 
Congress are financed by Brazilian agribusi- 
ness (10), they prioritize economic gains 
over environmental concerns. MP1154/2023 
and P1490 are likely to be approved by the 
Brazilian Senate, whose president is also 
aligned with the ruralist caucus (2), and 
President Lula may sign them, given that 
they have the support of his voter base (77). 
However, the Federal Supreme Court of 
Brazil has the power to safeguard the rights 
of thousands of Indigenous peoples in Brazil 
and reduce the pressure on their lands (12) 
by rejecting the bills’ legitimacy. 

If the bills become law and the Supreme 
Court allows them to stand, Brazilian com- 
modities will be inextricably linked to 
deforestation and the violation of the rights 
of Indigenous peoples. Global climate goals 
and public health would also be at risk. 
Countries that import commodities from 
Brazil, such as meat, soybeans, corn, ores, 
biofuels, oil, and minerals such as gold and 
iron, must review their trade policies con- 
sidering the environmental damage Brazil 
could perpetrate by producing them. 
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Brazilian Atlantic 
rainforest under attack 


Brazil has committed to ambitious goals, 
including the reforestation of 12 million 
hectares (7). However, a recent proposal 
by Brazil’s Congress (2, 3) puts at risk the 
Atlantic Forest, one of the world’s most 
threatened biomes (4). The Provisional 
Measure (MP) 1150/2022 aims to change 
the Atlantic Forest Law (11.428/2006) and 
the Forest Code (12.651/2012) by loosening 
restrictions on removing native vegeta- 
tion. After Brazil’s Chamber of Deputies 
and Senate approved the proposal in May, 
President Lula rejected it, but the Congress 
could overturn the presidential veto. 
National and international pressure are 
essential to persuade Brazil’s Congress to 
comply with the presidential veto and pre- 
vent further attacks to the Atlantic Forest. 
MP 1150/2022 would allow the removal 
of native vegetation for construction in 
linear infrastructure projects, such as gas 
pipelines, power transmission lines, and 
supply systems, without the environmental 
impact assessments and ecological com- 
pensation that are currently required by 
law. The pending bill would also eliminate 
some legal requirements for the protection 
and management of fauna that are cap- 
tured, collected, and transported during the 
deforestation stages of linear infrastruc- 
ture projects. Unlike the current Atlantic 
Forest Law, which establishes a percentage 
of forested area that must be preserved 
(5), the pending bill would allow reduced 
green areas in urban environments. The 
bill allows restoration in permanent pres- 
ervation areas to serve as compensation for 
urban forest losses, but permanent pres- 
ervation areas should already be forested 
according to the Forest Code, making them 
ill-suited for this purpose. 


MP 1150/2022 would also undermine 
financial incentives to mitigate climate 
change. Currently, financial institutions can 
require that businesses comply with the 
Atlantic Forest Law and the Forest Code to 
qualify for subsidies. For example, Rural 
Environmental Registry (CAR) tracks irreg- 
ularly deforested rural areas, and property 
owners must restore the forest area within 
a certain amount of time. The proposed bill 
would postpone the deadline for environ- 
mental restitution (through restoration or 
paying a fine), hampering financing agen- 
cies’ ability to make restoration compliance 
a credit requirement (6, 7). This change 
could have troubling effects given that 
about 50% of the native vegetation of the 
Atlantic Forest is on private land (8). 

The Atlantic Rainforest is a global bio- 
diversity hotspot (9), with myriad endemic 
(4, 8) and threatened species (J0) and 
many species that are important to diverse 
traditional and Indigenous peoples (8). 
Tropical forests are responsible for 36% 
of carbon storage in the world (8). Along 
with four other countries, Brazil has about 
37% of the world’s potential carbon stock 
(11). Permitting deforestation in the biome 
could have local and global consequences, 
including species loss (3), potential dis- 
ease emergence (4), threats to biocultural 
diversity, a decrease in available ecosystem 
services (8) such as carbon capture and 
storage, and intensified climate change. 
For Brazil to assume a leadership role on 
environmental action, Brazil’s Congress 
must reject this proposal and propose laws 
grounded in scientific knowledge and envi- 
ronmental protection. 
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Comment on “Food wanting is mediated by 
transient activation of dopaminergic signaling in 
the honey bee brain” 


Andrew Barron et al. 

Huang et al. make an exciting claim about a 
human-like dopamine-regulated neuromod- 
ulatory mechanism underlying food-seeking 
behavior in honey bees. Their claim is based 
on experiments designed to measure brain 
biogenic amine levels and manipulate recep- 
tor activity. We have concerns that need to 
be addressed before broad acceptance of 
their results and the interpretation provided. 
Full text: dx.doi.org/10.1126/science.adg3916 


Response to Comment on “Food wanting is 
mediated by transient activation of dopaminergic 
signaling in the honey bee brain” 


Songkun Su and Martin Giurfa 

Barron et al. criticize the work of Huang et 
al., putting the accent on the quantification 
of dopamine levels via high-performance 
liquid chromatography, yet also including 
data interpretation through alternative 
hypotheses aimed at invalidating the origi- 
nal ones proposed by Huang et a/. We thank 
the authors of this Technical Comment, 
which allows us to clarify technical aspects 
of our work that may have been unclear, and 
for promoting discussion around the conclu- 
sions of our work. We provide answers to the 
points raised in their comment. 

Full text: dx.doi.org/10.1126/science.adg6207 
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Paralysis under pressure 


Treating cardiovascular paralysis with epidural 


electrical neuromodulation 


By Aaron Phillips 


he Edwin Smith papyrus, an ancient 
Egyptian medical text, described spi- 
nal cord injury (SCI) as an untreatable 
condition (7). More than 3000 years 
later, an amazing breadth and depth 
of neuroscience research has not pro- 
foundly improved this prognosis. Once an 
SCI occurs, little is available to medically 
treat the motor, sensory, and autonomic is- 
sues that permeate into all aspects of life (2). 
One devastating consequence of SCI is car- 


diovascular paralysis, leading to insidiously 
low blood pressure (BP) known as hypoten- 
sion. Low BP after SCI can threaten survival, 
impair neurological recovery, and increase 
cardiovascular disease risk (2-4). Autonomic 
issues such as cardiovascular paralysis are 
often a greater patient health priority for 
people with SCI than walking again (5). 
Hypotension after SCI is due to the in- 
terruption of descending pathways from 
the brainstem to sympathetic circuits be- 
low the level of injury. The disruption of 
these descending pathways disables a key 


Mechanism-guided development and deployment of a 
neuroprosthetic baroreflex for spinal cord injury 


(A) Negative-pressure chamber to simulate an orthostatic challenge in a rodent with a wireless telemetry 
system implanted to record hemodynamics and sympathetic nerve activity 24/7. (B) Closed-loop stabilization 
of blood pressure using a proportional integral (Pl) controller that adjusts the amplitude of traveling epidural 
electrical stimulation (EES) waves over the three hemodynamic hotspots to maintain a defined blood pressure 
target. (C) Computational model of the human participant, including predicted spatial configurations to target 
the hemodynamic hotspots. (D) Orthostatic challenge on a tilt table demonstrates pressor response with EES. 
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BP-stabilizing system, termed the arterial 
baroreflex. New therapies for hypoten- 
sion after SCI are needed, as current clini- 
cal management options lack efficacy and 
commonly lead to harmful side effects (2). 
One potential treatment that can facilitate 
volitional movement of the legs and walk- 
ing after SCI is lumbosacral epidural stim- 
ulation of the spinal cord (6). Our group 
made a tantalizing and unexpected obser- 
vation in one patient that epidural electri- 
cal stimulation (EES) of these lumbosacral 
spinal cord segments transiently increases 
BP (7). Lumbosacral segments contain few 
sympathetic preganglionic neurons, which 
control blood vessel tone, and we thought 
it unlikely that this approach could har- 
ness the maximum potential of EES to 
activate sympathetic circuits and elevate 
BP after SCI. We were intrigued to further 
explore whether EES could be optimized to 
treat hypotension after SCI. 

Here, I describe our recent odyssey of 
conceiving, developing, and validating an 
ultrafast, reliable, “neuroprosthetic baro- 
reflex” that targets EES to discrete spinal 
cord circuits to treat hypotension after SCI. 
Throughout this journey, we developed 
cutting-edge tools to reveal the major mech- 
anisms of hypotension after SCI, and the spe- 
cific neurotargets to treat hypotension (8). 

The foundation of this research was laid 
by a new heuristic model of hypotension. 
We developed a servo-controlled hind-limb 
negative-pressure chamber that induced in 
rodents the hypotension experienced by hu- 
mans with SCI (see the figure, panels A and 
B) (8). We used rostroventral lateral medulla 
injections of Cre-dependent anterograde 
tracers into tyrosine hydroxylase-Cre rats to 
establish that hypotension after SCI is due 
to a near-complete disruption of descend- 
ing sympatho-excitatory pathways across the 
site of injury. After a severe spinal cord con- 
tusion survival surgery, 24/7 monitoring data 
of beat-by-beat BP and heart rate were col- 
lected over several weeks. Using this model, 
we demonstrated that hypotension rapidly 
develops immediately after injury and per- 
sists into the chronic state months later (8). 

By using segment-by-segment EES, we 
found that the three most caudal spinal seg- 
ments of the thoracic cord were the most 
powerful areas to elicit a robust increase 
in BP, termed a “pressor response” (see the 
figure, panel D). Retrograde pseudorabies 
tracing from the primary hemodynamic- 
related peripheral ganglia revealed that 
these thoracic spinal segments also housed 
the greatest density of sympathetic circuits 
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for BP control. We termed these segments 
the “hemodynamic hotspots” as they were 
both functionally and structurally en- 
riched for BP control (8). These key results 
provided a mechanistic framework for de- 
veloping the hardware required to target 
the hemodynamic hotspots effectively, and 
therefore enable the rapid control of BP in 
a closed loop (8). 

To stimulate the hemodynamic hotspots 
with electrical current, we designed and 
built an implantable electrode array that 
obeyed the geospatial confines of the tho- 
racic cord, with electrode placements to spe- 
cifically activate the caudal thoracic spinal 
segments (8). To stabilize BP, this hardware 
was paired with a custom closed-loop system 
to proportionally control stimulation ampli- 
tude depending on the deviation between a 
preprogrammed set-point for normal BP and 
real-time beat-by-beat BP. This system was 
termed the “neuroprosthetic baroreflex.” 
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The acute phase of an SCI is the most life- 
threatening period and a time when the spi- 
nal cord can partially recover if BP is well 
managed. However, it is challenging for cli- 
nicians to stabilize BP during this phase of 
injury (3, 4). We next tested whether the neu- 
roprosthetic baroreflex could stabilize BP in 
the acute phase of SCI using clinical-grade 
neurotechnologies. We ensured that the key 
elements of neurointensive care were rep- 
licated in a nonhuman primate model (i.e., 
propofol-based anesthesia, an arterial pres- 
sure line, and artificial ventilation). The SCI 
led to neurogenic hypotension mimicking 
that occurring after SCI in humans (8). 

To modulate hemodynamic hotspots in 
the larger nonhuman primates, we scaled 
up our implantable electrode array, which 


was connected to a clinical-grade implant- 
able pulse generator with wireless com- 
munication and custom-made software for 
real-time closed-loop control. Even just a 
few hours after the SCI, the neuroprosthetic 
baroreflex instantly normalized BP and was 
effective continuously without the need for 
adjustment. The neuroprosthetic barore- 
flex was robust enough to stabilize BP in 
the face of neurogenic shock and near-fatal 
hemodynamic collapse (8). 

To bring this therapy to the community, 
we needed to test our approach in a human 
as a proof of concept. In an individual with 
chronic complete cervical SCI resulting in 
refractory and debilitating hypotension, a 
clinically approved electrode array was sur- 
gically positioned over the hemodynamic 
hotspots. We predicted optimal electrode 
configurations to target the hemodynamic 
hotspots using computer simulations (see 
the figure, panel C). For this individual, 
EES led to powerful pressor responses and 
increased sympathetic nerve activity, and 
the system stabilized BP in real time dur- 
ing orthostatic challenges (see the figure, 
panel D) (8). 

Long-term use of this therapy allowed the 
individual to cease other medical treatments 
for hypotension, led to a 90% improvement 
in quality of life, and permitted participation 
in upright physical rehabilitation (8, 9). 

Our intellectual property surrounding this 
technology is licensed to publicly traded ON- 
WARD Medical. Together we have initiated 
the next steps in our journey—clinical trials 
to test the viability of this potential thera- 
peutic for medical indication. Our team is 
excited to share the results of our endeavor 
with the community and help shift the an- 
cient Egyptian notion toward one of opti- 
mism for those with SCI. 
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Edited by Michael Funk 


SPECIES COEXISTENCE 


Tropical trees deter similar neighbors 


ropical forests host an unusually high diversity of tree species. 


Strong interactions between individuals are hypothesized to 
create these patterns. A tree is more likely to survive when 
surrounded by different tree species with different resource 
needs, diseases, and herbivores. Kalyuzhny et al. found pat- 
terns consistent with this mechanism in a long-term forest plot in 


Check for 
updates 


Panama. Adult trees in this site are more distant from members of 
their own species than from other species and more distant than 
would be expected by chance or by the limits of seed dispersal. This 
study shows that distances between conspecifics are maintained in 
adult trees, helping to explain the high diversity of tropical forests. 
—BEL Science, adg7021, this issue p. 563 


Interactions between individual trees promote high species diversity in tropical forests. 


ORGANIC CHEMISTRY 
Oxygen-assisted 
triangulations 


Strained three-carbon rings, or 
cyclopropanes, are important 
constituents of a wide variety 

of natural products, pharma- 
ceuticals, and agrochemicals. 
However, most methods for their 
chemical synthesis unfortunately 
still rely on hazardous diazo 
reagents or sensitive doubly halo- 
genated carbon centers. Poudel 
et al. now report a versatile 
cyclopropanation reaction using 
readily available, stable com- 
pounds with methylene centers 
bearing electron-withdrawing 
ketone, ester, or cyano groups. 
Activation of these precursors 


496 


relies on photoreduction of 
oxygen by a sensitizer, with 
catalytic iodine to turn over the 
cycle. —JSY 

Science, adg3209, this issue p. 545 


GEOPHYSICS 
Following fluid injection 


Volcanoes and glaciers have 
seismic tremors that are caused 
by fluid movement or other 
frictional processes. Niyogi et 
al. found that tremors with very 
similar characteristics also 
occurred during a supercritical 
carbon dioxide injection pro- 
gram in Kansas. After carefully 
subtracting background noise, 
the authors found tremors 
located near fault structures in 
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the interior that likely were due 
to frictional slip. This finding 
may provide a way to model and 
track stress distribution in the 
subsurface during the injection 
of carbon dioxide or other fluids. 
—BG 

Science, adh1331, this issue p. 553 


CYTOKINE SIGNALING 
TMDs help ‘yc cytokine 
receptors assemble 


Gamma-chain (yc) cytokine 
receptors play critical roles in 
the development, proliferation, 
and activation of immune cells. 
They are composed of a com- 
mon y-chain that associates 
with interleukin receptor (ILR) 


subunits specific to each cyto- 
kine. Association between these 
subunits is driven by interleukin 
binding to ILR ectodomains 
and subsequent recruitment of 
the yc. Cai et al. used nuclear 
magnetic resonance spectros- 
copy and functional mutagenesis 
of two ILRs to show that the 
transmembrane domains, which 
link ectodomains to intracellular 
signaling domains, also play 
an active role in signaling. The 
authors further show that the 
yc transmembrane domain can 
specifically recognize various ILR 
transmembrane domains from 
the yc receptor family through 
a conserved “knob-into-hole” 
mechanism. —STS 

Science, add1219, this issue p. 569 


science.org SCIENCE 


PHOTO: SEBASTIAN KENNERKNECHT/MINDEN PICTURES 


PHOTO: YULIA REZNIKOV/GETTY IMAGES 


The gut-retina axis 
in glaucoma 


T cells are likely involved in 
glaucoma pathogenesis, but 
how they gain retinal access 
remains elusive. He et al. dis- 
covered an association between 
CD4* T cells that express the 
gut-homing integrin 87 with 
glaucoma disease severity in 
patients. Experiments with 
a glaucoma mouse model 
showed that 87* CD4* T cells 
take a detour to the gut, 
where they undergo genetic 
reprogramming to gain retinal 
access. Blocking mucosal 
addressin cell adhesion mol- 
ecule 1 prevented the retinal 
infiltration of B7* CD4* T cells 
and ameliorated the loss of 
retinal ganglion cells. —DN 
Sci. Transl. Med. (2023) 
10.1126/scitransimed.adg1656 


The inner workings 
of tumors 


One of the challenges in study- 
ing human disease is that 
the same condition can mani- 
fest differently across patients. 
However, patient variation 
can also be a positive thing, 
revealing information about 
the composition of diseased 
tissues and the relationship 
to disease outcome. Bill et 
al. used patient-to-patient 
variations to study how tumor 
microenvironments influence 
the progression of head and 
neck squamous cell carcinoma. 
The authors found that varia- 
tions in macrophage polarity, 
defined by the expression of 
two genes, CXCL9 and SPPI1, 
was a simple but critical feature 
of tumor microenvironments. 
The CXCL9:SPPI1 ratio could 
characterize the abundance of 
antitumor immune cells in can- 
cer, gene expression programs 
in each tumor-infiltrating cell 
type, the regulation of commu- 
nication networks that dictate 
tumor control or progression, 
and the response to immuno- 
therapy. —PNK 

Science, ade2292 this issue p. 515 
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Symbiont blocks parasite 
reproduction 


Malaria mosquitoes can act as 
hosts to several microorgan- 
isms, including commensal 
bacterial species. Huang et 
al. noticed that some labora- 
tory colonies of anopheline 
mosquitoes were incapable of 
transmitting malaria parasites. 
These insects also harbored a 
few cells of a bacterium called 
Delftia tsuruhatensis TC1, which 
produces a toxic alkaloid called 
harmane. Bacteria-produced 
harmane inhibited the develop- 
ment of female Plasmodium 
parasite gametes in the mos- 
quito gut. Harmane was found 
to be acontact poison that 
could also cross the mosquito 
cuticle to kill developing malaria 
parasites. Contained field trials 
in Burkina Faso, coupled with 
modeling studies, showed that 
the bacterium has the potential 
to be deployed in mosquito 
breeding sites as a component 
of malaria control. —CA 

Science, adf8141, this issue p. 533 


Closing in on how 
STING functions 


Stimulator of Interferon 
Genes (STING) is an innate 
immune sensor that activates 
noncanonical autophagy 
and the inflammasome. The 
exact mechanisms involved in 
this process are unclear, but 
proton leakage from organ- 
elles appears to be a common 
feature. Liu et al. analyzed 
STING’s structure, hypothesiz- 
ing that its transmembrane 
domain forms a pore capable 
of proton transport. The 
authors used intracellular pH 
measurements and cell-free 
proteoliposome assays to 
show that STING can transport 
protons across membranes. 
Furthermore, this activity 
and STING’s proton leakage- 
dependent downstream 
functions were inhibited by a 
small molecule that binds at 
the pore. —SMH 

Science, adf8974, this issue p. 508 


CHEMISTRY 


Edited by Caroline Ash 
and Jesse Smith 


Drugs from turpentine 


rom the inception of modern chemistry, many drugs, 

dyes, and fragrances have been derived from coal or oil. 

Although they are a very small fraction of the total car- 

bon footprint, developing sustainable sources of these 

products is important to build up the sustainable chem- 
istry sector. Tibbets et al. developed divergent syntheses of 
two common over-the-counter painkillers using monoter- 
penes that can be sourced from forestry by-products. Several 
intermediates in this process could be developed for use in 
other consumer products. —MAF 


ChemSusChem. (2023) 10.1002/cssc.202300670 


Biorenewable monoterpenes, available in industrial quantities 
from a wide range of plants, have great potential as feedstocks for 


many common pharmaceuticals. 


Seeing Parkinson’s 
disease 


Parkinson's disease is charac- 
terized by the misfolding and 
aggregation of a-synuclein 
protein in the brain. Radioactive 
tracers that bind to pathological 
protein aggregates have enabled 
the study of other neurodegen- 
erative disorders with positron 
emission tomography (PET); 
however, a method of imaging 
a-synuclein aggregates in vivo 
has been lacking. Xiang et al. 
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synthesized a new radioactive 
tracer, ['°F]-FO502B, that binds 
misfolded a-synuclein in vitro 
and enables PET imaging of 
a-synuclein aggregates in the 
brains of nonhuman primates. 
['8F]-FO502B was specific for 
a-synuclein and did not bind 
substantially to aggregates of 
tau or amyloid-a, which are char- 
acteristic of Alzheimer’s disease 
but are also commonly found 
in individuals with Parkinson's 
disease. —SAL 
Cell (2023) 
10.1016/j.cell.2023.06.004 
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HUMAN GENETICS 
Historical ties found in 
long-buried DNA 


Catoctin Furnace operated in 
Maryland in the 18th and 19th 
centuries using both free and 
enslaved African American labor. 
Harney et al. analyzed DNA from 
27 individuals from an African 
American cemetery there that 
was excavated 40 years ago 
during highway construction (see 
the Perspective by Jackson). The 
authors found genetic evidence 
of biological family groups, 
modern-day African populations 
with whom they may have shared 
ancestry, and even possible dis- 
tant relatives in the United States 
through identity-by-descent 
comparisons with consenting 
customers of 23andMe. This 
study demonstrates that when 
studied responsibly with input 
from stakeholders, long-buried 
DNA can be used to uncover 
obfuscated or forgotten histo- 
ries of marginalized individuals. 
—CNS 

Science, ade4995, this issue p. 500; 

see also adj2380, p. 482 


MICROBIOME 
Improving diabetes 
control 


Products from the gut micro- 
biota have been implicated in 
various metabolic disease states 
and drug transformations. K. 
Wang et al. developed a quanti- 
tative experimental workflow to 
identify isozymes from human 
and mouse gut microbiota 

that influence host physiology. 
Among these, the authors identi- 
fied Bacteroides spp. microbial 
dipeptidyl peptidase 4 (DPP4). 
Host DPP4 is an important 
therapeutic target for the man- 
agement of type 2 diabetes. The 
DPP4 produced from gut micro- 
organisms can also degrade the 
host's glucagon like peptide-1 
and contribute to compromising 
glucose homeostasis, but only 

if the gut barrier is made leaky 
by a high-fat diet. Moreover, the 
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authors discovered a specific 
microbial DPP4 inhibitor, 
daurisoline, which could be 
used to support treatment with 
sitagliptin, which tends to show 
variable efficacy. —CA 

Science, add5787, this issue p. 501 


SPATIAL OMICS 
An overview of modern 


omics approaches 


All of biology happens in space. 
In living organisms, cells must 
interact and assemble in three- 
dimensional tissues. The position 
of each cell is just as important 
as its intrinsic nature in deter- 
mining how a tissue functions 
or malfunctions in a disease. 
Recently, many technologies 
have been invented to profile 
cells without removing them from 
their natural context, measur- 
ing gene expression and the 
regulatory landscape of a cell's 
genome alongside its spatial 
location within a tissue. In a 
Review, Bressan et al. describe 
the features of these methods, 
collectively named spatial omics, 
and discuss what is missing for 
them to unlock their full potential. 
—SMH and GKA 

Science, abq4964, this issue p. 499 


MICROBIOTA 
Pancreatic polypeptide YY 
tames Candida 


The mammalian pancreas 

and gut enteroendocrine cells 
secrete a family of peptides, 
including pancreatic polypep- 
tide YY (PYY), that affect gut 
secretion, motility, and appetite. 
Paneth cells in the gut secrete 
several antimicrobial peptides 
into the lumen to regulate the 
microbiota, as well as PYY. 
Noticing that PYY resembles the 
structure of known antimicrobial 
peptides, Pierre et al. investi- 
gated the possibility of another 
function for PYY (see the 
Perspective by Lin and lliev). The 
Paneth cell-derived PYY was 

full length, unlike the endocrine 


form, and although it had some 

antibacterial activity, it was 

most effective at inhibiting the 

ubiquitous gut fungus Candida 

albicans from transforming 

from the commensal yeast form 

into the invasive hyphal form. 

It thus appears that full-length 

PYY plays a fundamental role in 

keeping a fungal member of the 

mouse microbiota in check. —CA 
Science, abq3178, this issue p. 502; 

see also adj1240, p.483 


FERROELECTRICS 
Flexible ferroelectrics 


Ferroelectric materials change 
electrical properties in response 
to an electric field, which is a 
useful property for a range of 
applications. However, even poly- 
mer-based ferroelectrics tend to 
lack flexibility. Gao et al. added 
a small amount of cross linking 
to a ferroelectric poly(vinylidene 
difluoride) terpolymer, which 
dramatically improved its 
elasticity (See the Perspective by 
Zhang and Xiong). The mate- 
rial can be stretched repeatedly 
without degrading its ferro- 
electric properties, making it a 
potentially attractive strategy 
for developing wearable devices 
or for use in other applications. 
—BG 

Science, adh2509, this issue p. 540; 

see also adj2420, p. 484 


ORGANIC CHEMISTRY 
Glimpsing a nitrene 
intermediate 


Nitrenes are highly reactive 
chemical intermediates that are 
widely implicated in the forma- 
tion of carbon-nitrogen bonds. 
Electron-deficient nitrenes, in 
which a nitrogen atom is bonded 
solely to an acyl (RC=O) group, 
have proven especially hard to 
glimpse. Jung et al. report crys- 
tallographic characterization of 
this acyl nitrene motif coordi- 
nated to a rhodium catalyst. The 
authors were able to access this 
by tethering a ligand to a precur- 
sor that expelled carbon dioxide 
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upon photoexcitation, leaving 
the nitrene exposed. A reaction 
with acetone observed in the 
crystal validated the expected 
reactivity pattern. —JSY 

Science, adh8753, this issue p. 525 


FERROELECTRICS 
Aquick switch 


Hafnium oxides are attrac- 
tive as materials for computer 
memories and other applica- 
tions because their ferroelectric 
properties persist even for 
very thin layers. However, the 
relatively large risk of breakdown 
is one of the limiting factors in 
commercialization. Y. Wang et 
al. discovered that adding more 
hafnium and zirconium atoms 
helps to stabilize the ferroelec- 
tric properties. This allows for 
many more cycles before failure 
due to a decrease in the field 
required for polarization switch- 
ing. —BG 

Science, adf6137, this issue p. 558 


AUTOIMMUNITY 
T cell complications 


Immune checkpoint inhibitor 
(ICl)—based therapies have 
transformed cancer treatments 
but have also been associated 
with autoimmune side effects. 
Wang et al. characterized T 
cell-mediated responses in the 
joints of patients with inflamma- 
tory arthritis who had received 
anti-PD-1-based ICI therapy 
(ICl-arthritis). Compared with 
patients with rheumatoid arthri- 
tis or psoriatic arthritis, patients 
with ICl-arthritis had expansion 
of a particular T cell subset in 
their joints that had elevated 
expression of interferon-associ- 
ated transcripts. In vitro studies 
confirmed that type | interferon 
treatment of CD8T cells could 
induce phenotypic changes con- 
sistent with the observed T cell 
subset, which was also detected 
in a cohort of melanoma patients 
undergoing treatment. These 
findings provide evidence that 
ICI therapy can directly influ- 
ence CD8T cell responses in 
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individuals who develop spon- 
taneous |Cl-arthritis, which is 
distinct from other autoimmune 
arthritides. —CNF 
Sci. Immunol. (2023) 
10.1126/sciimmunol.add1591 


CANCER 
Tolerating targeted 


chemotherapeutics 


For patients with MAPK signal- 
ing—driven cancer, subsets of 
tumor cells often persist despite 
treatment with targeted inhibi- 
tors. Hoffman et al. identified a 
common mechanism of tumor 
tolerance to MAPK pathway 
inhibitors. Lung, colon, or skin 
cancer cells that initially went 
dormant under drug treatment 
and then re-entered the cell 
cycle relied on components of 
the Fanconi anemia pathway 
to tolerate the DNA replication 
stress caused by cycling in the 
presence of the drug. Clinical 
samples were consistent with 
the treatment-induced phe- 
nomenon of replication stress 
tolerance in relapsing tumors. 
—LKF 
Sci. Signal. (2023) 
10.1126/scisignal.ade8744 
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SPATIAL OMICS 


The dawn of spatial omics 


Dario Bressan, Giorgia Battistoni, Gregory J. Hannon* 


BACKGROUND: Just as single-cell sequencing 
has revolutionized many fields of biology, spa- 
tial “omics,” in which molecular parameters 
are measured in situ on intact tissue samples, 
is set to empower a new generation of scien- 
tific discoveries. A plethora of new technolo- 
gies now enable spatial profiling of gene and 
protein expression, genetic mutations, epige- 
netic marks, chromatin structure, and genome 
organization. Most of these methods trace back 
to traditional techniques such as immunohisto- 
chemistry and in situ hybridization, or leverage 
the throughput of next-generation sequencing 
by converting spatial coordinates to sequence 
barcodes. Spatial omics technologies operate at 
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vastly different levels of resolution, sensitivity, 
and throughput. There are also considerable 
differences in ease of adoption, compatibility 
with different sample types, commercial avail- 
ability, upfront investment, and cost per sam- 
ple. In such a rich and constantly evolving 
field, choosing the best technology to address 
a specific biological challenge—carefully weight- 
ing the strengths and weaknesses of each—is 
critical. 


ADVANCES: The field of spatial molecular pro- 
filing has come of age over the past few years. 
Current RNA profiling technologies allow the 
spatial measurement of gene expression for 
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Spatial omics methods profile the molecular make-up of tissues, preserving their spatial organiza- 
tion through four main steps. (A) Detection. Oligonucleotide probes or modified antibodies bind to specific 


nucleic acids, proteins, or small molecules. Alternatively, enzymatic processes can detect RNA (reverse 
transcription) or DNA (tagmentation) in an untargeted fashion. Single or multiple types of biomolecules can 
be profiled simultaneously. (B) Identification. Biomolecules can be identified directly with next-generation 
sequencing or mass spectrometry, or indirectly by means of an individual tag on the detection probe that is 
either read at once or sequentially (combinatorial barcode). (©) Measurement. Imaging-based methods 
measure the signal intensity of fluorescent probes or count the number of spots per area (single-molecule 
ocalization methods). Sequencing-based methods quantify the abundance of biomolecules through 
normalized read count or using unique molecular identifiers (UMIs). (D) Localization. Biomolecules are 


assigned to their spatial locations either directly in imaging-based methods, or by means of a DNA barcode 
decoded through sequencing. Technologies differ widely in terms of spatial resolution, sensitivity, and number 


and size of areas that can be profiled. 
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quantify thousands of transcripts, including 
those of very low abundance, whereas un- 
targeted protocols offer transcriptome-wide 
profiling and can even detect gene isoforms 
and mutations, with coverage approaching 
that of disaggregated single-cell methods. 
Multiplexed immunohistochemistry protocols, 
either based on mass spectrometry or fluores- 
cence imaging, can similarly be used to profile 
tens to hundreds of protein markers on a wide 
range of samples. Recent studies have re- 
ported additional approaches that recover 
other types of molecular information, includ- 
ing the spatial profiling of genomic organiza- 
tion for thousands of loci, open chromatin and 
epigenetic marks, and untargeted in situ DNA 
sequencing. Multi-omic technologies captur- 
ing multiple sources of information at once 
are also becoming available. As these methods 
become more reliable and widely adopted, it is 
becoming increasingly clear that spatial omics 
will contribute substantially to the elucidation 
of many biological questions, with promi- 
nent examples in fields ranging from oncol- 
ogy (such as the study of tumor heterogeneity 
and microenvironment) to neuroscience and 
organismal development. 


OUTLOOK: Future development trends in the 
spatial omic area are needed in three main 
areas: (i) multi-omics, which is defined as the 
simultaneous measurement of different param- 
eters (for example, DNA, RNA, and protein); (ii) 
increased access and democratization, with 
technologies becoming more readily availa- 
ble, more reliable, and more robust; and (iii) 
improved analysis frameworks. As technologies 
continue to evolve, it will become necessary to 
dedicate more attention to data analysis and 
experimental design. Spatial omics technolo- 
gies can already generate many terabytes of 
data in a single experiment, creating substan- 
tial challenges in data processing, analysis, and 
visualization. Statistical methods and guide- 
lines for sample replication, study design, and 
batch correction are also lagging behind, al- 
though they're evolving rapidly. Last, as the 
future keeps getting closer, we foresee that 
spatial omics will likely evolve toward three- 
dimensional spatial omics (operating on whole 
organs or even organisms) and spatial-temporal 
omics (performing measurements multiple 
times in vivo), adding further powerful and ex- 
citing tools to biology’s toolbox. 
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*Corresponding author. Email: greg.hannon@cruk.cam.ac.uk 
Cite this article as D. Bressan et al., Science 381, eabq4964 
(2023). DOI: 10.1126/science.abq4964 


S READ THE FULL ARTICLE AT 
https://doi.org/10.1126/science.abq4964 


lof1 


RESEARCH 


SPATIAL OMICS 


The dawn of spatial omics 


Dario Bressan, Giorgia Battistoni, Gregory J. Hannon* 


Spatial omics has been widely heralded as the new frontier in life sciences. This term encompasses a wide 
range of techniques that promise to transform many areas of biology and eventually revolutionize pathology by 
measuring physical tissue structure and molecular characteristics at the same time. Although the field came 
of age in the past 5 years, it still suffers from some growing pains: barriers to entry, robustness, unclear 
best practices for experimental design and analysis, and lack of standardization. In this Review, we present 

a systematic catalog of the different families of spatial omics technologies; highlight their principles, power, and 
limitations; and give some perspective and suggestions on the biggest challenges that lay ahead in this 
incredibly powerful—but still hard to navigate—landscape. 


early all of the seemingly infinite com- 

plexity of biology occurs in three dimen- 

sions. Although we are all familiar with 

studying the spatial architecture of pro- 

teins and macromolecular machines, 
studying the architecture of even simple orga- 
nisms or single tissues requires not only de- 
ciphering the molecular profiles of thousands 
to millions of cells but also understanding how 
their spatial context influences their behavior. 
A recent burst of innovation in making multi- 
plexed spatial molecular measurements is 
opening a new chapter in biological research 
and enabling a renewed embrace of the com- 
plexity of living systems. 


Fig. 1. Spatial proteomics methods based on 
multiplexed antibody detection. Proteomics 
methods can be broadly divided into two types 

of protocols. In one (left), antibodies are all bound 
together and detected either with MS imaging 
(IMC or MIBI) or cyclic fluorescence imaging 
(CODEX). In the other (right), primary antibodies 

are themselves bound and stripped in each cycle 

(4i, CyCIF, or Cell DIVE MACSima). In IMC and MIBI, 
each antibody is bound to a different metal by a 
chelating polymer. During imaging, a laser vaporizes 
the tissue pixel by pixel, and the metals released 
from each spot are quantified with a mass spectrom- 
eter. The distribution of metals can then be used 

to deduce the presence and spatial distribution of 
specific protein antigens in the tissue. The two 
methods differ in how mass measurements are 
performed and in their resolution: ~1 ym for IMC and 
~300 nm for MIBI. In fluorescence-based methods, 
each cycle detects as many markers as there are 
florescence channels available, with complexity scaling 
linearly with the number of cycles. Binding all probes 
together (CODEX) results in faster imaging and 
shorter cycles but requires custom modification of each 


One of the first steps along this journey was 
the emergence of single-cell “omics” technol- 
ogies that operate on disaggregated tissues. 
These methods enabled the discovery of new 
cell types (7), cast new light on organismal de- 
velopment (2), and launched the process of 
creating comprehensive catalogs of human 
and mouse tissues (3). However, biological pro- 
cesses happen in a spatial context, and the 
three-dimensional (3D) arrangement of cells 
in a tissue has a profound effect on their func- 
tions. For example, location constrains cell-cell 
interaction by modulating contact or short- 
range paracrine signaling, which creates spe- 
cialized niches such as those that support many 
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types of tissue stem cells. Spatial gradients of 
both physical and chemical properties also 
contribute to differential regulation of biological 
processes at every scale, but are particularly 
prominent during the development of multicel- 
lular organisms. Regardless of their undisputed 
power, measurements made on disaggregated 
cells or nuclei lack this layer of information. 
The need for such knowledge has driven the 
development of “spatial omics”: methods capa- 
ble of measuring the molecular characteristics 
of cells in their native 3D context. 

Spatial omics technologies mainly measure 
either proteins and their modifications, or 
mRNAs, although spatial genomic and epige- 
nomic profiling is now emerging. Compared 
with their nonspatial counterparts, these meth- 
ods offer “shallower” profiling, often with an in- 
verse relationship between the degree of spatial 
resolution (or profiling depth) and the experi- 
mental time investment. Despite these limita- 
tions, this nascent field has already provided 
substantial and varied insights into several 
fields, including animal development and brain 
structure (4-6) and features of the tumor micro- 
environment (TME) that correlate with patient 
outcome (7, 8). Although we are at the very be- 
ginning of the spatial omics revolution, progress 
is happening at breakneck speed, and it is clear 
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antibody and extensive optimization. On the other hand, binding antibodies in cycles (4i, CyCIF, MACSima, and Cell DIVE) enables the use of “native” antibodies that can be 
obtained commercially but results in slower cycles (up to 1 day per round). Owing to the repeated rounds of microscopy, all cyclic imaging methods are slower than IMC and 
MIBI on a per-sample basis and present more challenges for cycle-to-cycle alignment of images (registration). However, the data-generating capacity is actually higher 
because the slow element is the staining process, which can be performed in parallel on many samples, whereas the imaging itself is much faster than that with MS. 
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Fig. 2. Spatial dissection and selective 
illumination methods. Several different 
methods can be used to restrict profiling 
to a specific section of a sample. Laser 
capture microdissection (LCM) relies on 
the physical isolation of a tissue fragment, 
from either flash-frozen or FFPE-treated 
material (24), followed by bulk measure- 
ments. By contrast, photo-sensitive groups 
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assign an area ID by means of crosslinking or 
hybridization, respectively. 


that these approaches will drive us toward a much 
deeper understanding of biology in context. 


The current generation of spatial molecular 
profiling tools 


The idea of performing spatial molecular mea- 
surements on tissue sections is hardly new. 
Immunohistochemistry (IHC) and in situ hy- 
bridization (ISH) have been in use for more 
than 50 years (9-11). Both ISH and IHC can be 
multiplexed to a degree corresponding to the 
number of dyes that can be differentiated 
during observation. However, a move toward 
“omic” level measurements by simply increas- 
ing the number of dyes is not possible. Spatial 
omics technologies therefore rely on other ap- 
proaches. They may use nonmicroscopic meth- 
ods such as mass spectrometry (MS) to detect 
analytes, perform cycles of staining and dye 
removal to achieve the required complexity, or 
harness the power of high-throughput DNA 
sequencing and barcoding. 


Multiplexed antibody profiling 


Over the past 20 years, MS has already been 
successfully adapted to in situ measurements 
(72). A more recent generation of MS, usually 
referred to as “mass cytometry” (MC), is based 
on detection of antibodies conjugated to iso- 
topically pure lanthanide metals (Fig. 1) (73). 
Each antibody is bound to a different metal by 
a chelating polymer. During imaging, a laser 
vaporizes the tissue pixel by pixel. Because 
lanthanides are essentially absent from bio- 
logical materials and their masses can be dis- 
criminated with great precision, this method 
allows the routine quantification of more than 
40 different species with a very high signal-to- 
noise ratio (SNR). This approach is the basis of 
two spatial imaging technologies known as 
imaging mass cytometry (IMC) (/4) and multi- 
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plexed ion beam imaging (MIBJ) (15). Although 
initially developed to detect proteins and pro- 
tein modifications through antibody binding, 
these methods have been adapted for RNA 
imaging by producing metal-conjugated hybrid- 
ization probes (16). However, these approaches 
are limited by the number of discriminable 
masses and, more importantly, by the availa- 
bility of sufficiently pure metals, a nontrivial 
issue considering the scarcity of some ores. 
Together, these factors constrain the achieva- 
ble complexity to roughly 50 species. 

Despite this limitation, spatial MC methods 
have been extremely successful, in part owing 
to the early commercial availability of instru- 
ments and reagents. Hundreds of studies that 
used the technology have been published on a 
wide variety of subjects (8, 17-22). Immunology 
and cancer biology, in particular, have benefit- 
ted hugely from the ability to perform a de- 
tailed characterization of the TME. An example 
is a recent survey of approximately 500 human 
breast cancer samples, which detected specific 
TME motifs and indicated that spatial features 
correlate strongly with patient outcome (8). 

An alternative method for multiplexed anti- 
body imaging is based on detection of conven- 
tionally labeled fluorescent antibodies over 
repeated imaging cycles (Fig. 1). In some of 
these methods, all antibodies are bound to the 
sample at once, and detection is performed in 
cycles by secondary probes, typically fluores- 
cent oligonucleotides hybridized or ligated to 
a “root” attached to the antibody [codetec- 
tion by indexing (CODEX) (23)]. In others, 
antibodies are themselves bound in cycles and 
either stripped after each round of detection 
(24, 25) or made undetectable by means of 
photobleaching or chemical cleavage [CyCIF 
(26, 27), Leica Cell DIVE (28), and Miltenyi 


bulk measuremenrs 
or dissociated scRNAseq  ¢ 


MACSima (29)]. Each of these strategies have 
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distinct advantages and disadvantages in terms 
of cycling speed and antibody optimization 
(Fig. 1), but overall, they have a greater data- 
generating capacity relative to that of MS. In- 
deed, cyclic imaging methods are likely the 
most viable alternative for projects that in- 
volve large sample collections (30, 31). 


Spatial transcriptomics and genomics 


The ultimate goal of a family of technologies 
known as “spatial transcriptomics” is to measure 
the abundance of every gene and gene isoform, 
at subcellular resolution, in a whole tissue sam- 
ple in three dimensions. At present, no technol- 
ogy is able to reach this target. Instead, each 
protocol makes trade-offs in terms of sensitivity, 
throughput, resolution, or ease of use. Current- 
ly, all technologies work on histological sections 
of various thickness and can be largely consid- 
ered “2D” methods, with some exceptions. 
Spatial genomic profiling is based on four 
main strategies: (i) microdissection methods in 
which specific areas of a tissue are physically 
isolated or otherwise marked so that their DNA 
or RNA can be extracted and processed; (ii) 
combinatorial fluorescence in situ hybridiza- 
tion (FISH) followed by single-molecule imag- 
ing, which allows counting of mRNA molecules 
present in a given area of a tissue; (iii) in situ 
sequencing of mRNA molecules; or (iv) spatial 
barcoding methods in which the DNA and 
mRNAs in a given area are linked to a specific 
predetermined DNA “barcode,” which allows 
them to be computationally mapped back to a 
spatial location after bulk sequencing. These 
approaches are described in more detail below. 


Microdissection and photo-isolation 


Microdissection is perhaps the most obvious 
way to add spatial information to molecu- 
lar profiling (Fig. 2), through purification of 
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Fig. 3. Imaging-based methods. Molecules are detected through hybridization 
and identified according to patterns of fluorescence signal, either driven by 
cooperative hybridization or incorporation of fluorescent nucleotides. In 
osmFISH, transcripts are individually detected in separate cycles of hybridization 
and signal release. MERFISH, seqFISH, and seqFISH+ use combinatorial 

labeling schemes to assign each transcript to a barcode, consisting of a subset of 
hybridization probes that are detected in cycles of imaging and release. Each 
digit is read in a separate cycle, with different colors (seqFISH+) or presence 
and absence of fluorescence (MERFISH) corresponding to different values. This allows 
he detection of up to X” genes (where X is the potential values of each digit and 
nis the number of cycles). Whereas seqFISH probes are directly conjugated to 
fluorophores, MERFISH and seqFISH+ use “saddle probles” with a region 
complementary to a set of fluorescently labeled “readout” probes, then are 
hybridized and read cyclically. Each spot corresponds to a RNA molecule, and the 


+ 


in situ sequencing based 


sequence of fluorescence signals identifies it. In “targeted” ISS methods, multiple 
transcripts of interest are bound by a library of detection probes that circularize 
through split ligation. ISS, BaristaSeq, and ExSeq use padlock probes to either 
form an exact junction or leave a gap that is filled by polymerase before ligation. 
In STARmap, a pair of oligonucleotides [snail (specific amplification of nucleic acids 
via intramolecular ligation element) probes] need to bind in close proximity to 
enable circularization. In “untargeted” methods, RNA is retrotranscribed from a 
poly(T) oligonucleotide, and the cDNA is then circularized through intramolecular 
single-strand ligation. The circularized molecules are amplified by means of 

RCA, producing rolonies. In all cases, Sequencing is performed on the rolonies by 
using either sequencing-by-ligation or sequencing-by-synthesis technologies. 
Targeted methods read a barcode within the probe itsef or in the gap-filled region 
(allowing detection of mutations). Untargeted methods sequence the cDNA 


itself, which can be purified and sequenced again in bulk (ExSeq) 


biomolecules from a specific area by means 
of physical separation. Laser-capture microdis- 
section (LCM) (32) is routinely used to iso- 
late small areas (tens to hundreds of cells) for 
RNA-sequencing (RNA-seq) and can reach 
single-cell resolution (33). 

Recent studies have used light, rather than 
physical dissection, to spatially select a region 
to profile. Light can be used to trigger reverse 
transcription (RT) [transcriptome in vivo anal- 
ysis (TIVA) tag or photo-isolation chemistry 
(PIC) (34, 35)], cleave a reporter [Nanostring 
GeoMx (36)], attach a purification label (Syncell 
MicroScoop), or remove a blocking group that 
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prevents ligation of a sequencing adapter [photo- 
selective sequencing (PSS)] (37). Light can also 
trigger attachment of a DNA barcode, or bar- 
code combination, to specific areas either by 
driving their cross-linking or hybridization to 
the tissue (38, 39). The barcode is then used to 
reassign reads to these areas. 

All microdissection technologies share some 
key advantages and drawbacks: They offer 
very deep profiling, almost to bulk-sequencing 
levels, and allow the area of interest to be cus- 
tomized from a single cell to a whole region. 
However, they have a comparatively low through- 
put (usually below hundreds of locations) be- 
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cause each selected area has to be collected 
and processed individually. Barcoding meth- 
ods partially bypass this limitation but are still 
largely limited to hundreds of areas at most. 


Multiplexed ISH 


Cyclic hybridization-based methods (Fig. 3) 
are the descendants of single-molecule FISH 
(smFISH), a set of technologies that resolve 
individual mRNA molecules in tissues as sub- 
diffraction fluorescent spots. To enable detec- 
tion, these protocols require multiple probes 
to bind to the same mRNA molecule, either 
because multiple fluorophores bound to a small 
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area are needed in order to produce a signal 
(40, 41) or because the concurrent binding of 
two probes within a few nucleotides of dis- 
tance triggers a signal amplification process 
(42-44). smFISH is often considered the “gold 
standard” in RNA quantification methods be- 
cause it can detect very low-abundance tran- 
scripts down to a single copy per cell, and 
therefore the spatial profiling technologies de- 
rived from it tend to have excellent sensitivity. 

All cyclic FISH methods share some com- 
mon elements: (i) a way to associate combi- 
natorial barcodes to transcripts through the 
hybridization of short DNA probes; (ii) a meth- 
od to read the barcodes, element-by-element, 
with cycles of fluorescence imaging; (iii) a 
strategy to produce sufficient signal on each 
molecule to enable detection, either through 
sheer probe number or by signal amplifica- 
tion; and (iv) a way to reset the signal after 
each cycle. In sequential FISH (seqFISH), 
fluorophore-labeled FISH probes are directly hy- 
bridized to cellular mRNAs and removed after 
each cycle through deoxyribonuclease (DNase) 
digestion, leaving the mRNA in place (45). In 
multiplexed error robust FISH (MERFISH), 
the probes include both a mRNA-binding re- 
gion and a series of “reporter” regions that 
correspond to the barcode elements, which are 
detected with secondary hybridization rounds 
(46). This makes detection effectively indepen- 
dent of the original RNA molecule and makes 
the protocol more resistant to ribonuclease 
(RNase) contamination, while also decreasing 
imaging time and making imaging of hun- 
dreds to thousands of targets practical. An 
alternative version of seqFISH, seqFISH+, fol- 
lows the same approach (47). Another ele- 
ment introduced by MERFISH and adopted in 
seqFISH+ is making the combinatorial barcode 
“error robust” by using encoding strategies 
taken from information theory, thus mitigat- 
ing the impact of hybridization failures or 
nonspecific binding, which can be quite sub- 
stantial owing to the cyclical nature of imaging. 

In the past few years, both MERFISH and 
seqFISH+ have been extended to the whole- 
transcriptome level (47, 48), adapted to simul- 
taneous detection of mRNA and protein by 
oligo-conjugated antibodies, and enhanced by 
the addition of signal amplification strategies 
and tissue embedding and clearing (44, 49, 50). 
MERFISH has also been adapted to expansion 
microscopy, a technique for physically expand- 
ing samples to increase the size of biological 
structures. Combining MERFISH with expan- 
sion microscopy (57) permits the identification 
of transcripts that would otherwise be too close 
to each other to be recognized as individual 
molecules. MERFISH and seqFISH+ are ef- 
fectively very similar in terms of sensitivity, 
throughput, and pros and cons, with a few 
technical differences in protocol and details of 
the barcoding scheme. Both have been used 
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successfully to investigate spatial gene expres- 
sion in different organs and tissues, although 
their technical complexity has laborious im- 
plementation that has limited their broad 
adoption, although recently MERFISH has 
been made commercially available and sim- 
pler to use. 

Although MERFISH and seqFISH are the 
two main hybridization-based in situ transcrip- 
tomic methods currently available, a few other 
technologies share some of their key elements. 
Ouroboros smFISH (osmFISH) is a cyclical 
smFISH method that lacks a barcoding scheme, 
trading multiplexing capacity (only a few tran- 
scripts per cycle are detected) for a simpler 
protocol not affected by transcript abundance 
or density (52). Saber-FISH and Clamp-FISH 
are based on different signal amplification 
methods and offer substantially improved 
SNR compared with that of “first-generation” 
protocols, but they have not been shown to 
enable detections of more than 100 transcripts 
(53-55). SCRINSHOT (56), a similar method, is 
specifically designed to detect approximately 
30 transcripts on challenging formalin-fixed 
paraffin-embedded (FFPE) tissue. A more re- 
cent protocol called enhanced electric FISH 
(EEL-FISH) uses electrophoresis to drive cel- 
lular mRNAs to the surface of a conductive 
glass slide. The tissue is then removed, and the 
mRNAs are detected with multiplexed FISH 
(57). Removal of tissue results in a substantial 
increase in SNR and in much higher speed. 
Last, commercial options are becoming avail- 
able for several of these technologies. Many of 
these solutions offer profiling of thousands of 
genes and, optionally, of tens of proteins sim- 
ultaneously (58). 


In situ sequencing 


Close relatives of cyclic-FISH protocols are in 
situ sequencing technologies (Fig. 3). The prem- 
ise of this family of methods is to execute 
high-throughput sequencing chemistry on clus- 
ters generated in situ within a histological sec- 
tion. The sequencing technology used most 
often is sequencing by ligation (rather than 
sequencing by synthesis, which dominates tra- 
ditional next-generation sequencing). A common 
step to all of these protocols is the production 
of a DNA “nanoball” (rolony) through rolling 
circle amplification (RCA) of a circular tem- 
plate produced from the detected transcripts. 
A separate rolony is produced for each mRNA 
molecule, allowing transcript quantification. 
The maximum read length is ~30 nucleotides 
because long reads are much more difficult to 
perform in situ than on a flow cell owing to 
the many variables at play. 

One of the first examples of these methods, 
ISS, is based on padlock DNA probes hybrid- 
ized to mRNA (59-61). The padlock probe 
either produces an exact juxtaposition of the 
probe ends, which are joined by ligation, or 
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leave a gap that is filled by a polymerase and 
then ligated. In the former variant, sequenc- 
ing is performed on a reporter barcode se- 
quence within the noncomplementary region 
of the probe, whereas in the latter, any se- 
quence present in the gap is read. This allows 
both gene expression profiling (in which hy- 
bridization events to a preset list of genes are 
counted) or untargeted de novo sequencing of 
single-nucleotide polymorphisms or mutations 
(still limited to a specific gene list). The method 
evolved over time and was improved by re- 
placing sequencing by ligation with a new 
form of sequencing by hybridization (in effect 
a form of specialized seqFISH-MERFISH per- 
formed on barcoded rolonies), with a resulting 
increase in SNR (62). 

A contemporaneous technology called fluo- 
rescence in situ sequencing (FISSEQ) (63) 
generates the RCA amplicon through the cir- 
cularization of a cDNA molecule, itself produced 
by means of RT of mRNAs by using an oligo(dT) 
primer. The technology allows true “untargeted” 
sequencing in space, although only for very 
short, 3’ terminal reads and at the expense of a 
very low sensitivity (well below 1% of total 
cellular transcripts) owing to the low effi- 
ciency of in situ RT and cDNA circularization. 

ISS and FISSEQ rapidly gave rise to a variety 
of derivative methods with higher efficiency 
and improved features. In STARmap (spatially 
resolved transcript amplicon readout map- 
ping) (64), two probes are used rather than a 
single padlock probe, one acting as a splint for 
the other upon binding in close proximity. 
This increases sensitivity, although still not in 
the range of the hybridization-based methods. 
The sequencing chemistry is also altered and 
improved compared with traditional sequenc- 
ing by ligation. BaristaSeq (65) increases ef- 
ficiency by optimizing the gap-filling padlock 
probe method and using Illumina sequenc- 
ing by synthesis for detection [resolution is 
unclear because (65) generally applies the tech- 
nique to the sequencing of a single barcode per 
cell]. More recently, both the targeted padlock- 
based ISS method and the untargeted FISSEQ 
method have been combined with expansion 
microscopy chemistry (66), with substantial 
improvements in both efficiency and SNR 
owing to the decrowding of rolony signals and 
sample clearing. One of the key innovations of 
ExSeq is the combination of untargeted, short- 
read ISS with regular “bulk” sequencing per- 
formed on the same sample after extraction 
of the cDNA contained in the tissue. The short 
sequences serve as distinct “tags” to match 
each “bulk” read to a specific location, effec- 
tively resulting in full-length, long-read ISS 
that is sufficient to map splice isoforms with 
subcellular resolution in complex tissues such 
as a mouse brain. 

FISH-based and in situ sequencing-based 
methods share one common element: detection 
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is ultimately performed by means of micros- 
copy imaging of the tissue at high resolution, 
and gene expression is measured by effectively 
counting individual molecules of mRNA. Al- 
though imaging is the most direct way to obtain 
spatial information, single-molecule microscopy 
is very challenging and requires extremely pre- 
cise instruments and procedures. This technical 
barrier is compounded by the need for multiple 
imaging cycles, combined with extensive sam- 
ple manipulation through fluid exchanges, en- 
zymatic reactions, or other procedures. Not 
surprisingly, image co-registration and feature 
extraction are among the biggest challenges 
in the analysis of all imaging-based spatial 
omics datasets (also discussed in Big data to 
useful data—the challenge of analysis). Single- 
molecule microscopy can also be slow because 
the need for high magnification and resolu- 
tion produces a very small field of view, re- 
quiring extensive tiling and image stitching 
(in itself not a trivial procedure) to acquire any 
useful sample size. Molecular crowding within 
the cell also imposes an upper boundary to the 
number of molecules that can be resolved 
given the diffraction limit. Detection sensitiv- 
ity is therefore biased by cell size, with larger 
cells allowing deeper profiling. 


Spatial barcoding of bulk sequencing reads 


One might argue that the best available tool to 
produce high-quality transcriptome quantifi- 
cation data with very high speed and through- 
put is already available in the form of regular 
next-generation sequencing, if only spatial po- 
sition could be encoded in sequence space. 
This is exactly the key insight behind ST (Fig. 
4) (67). In ST, an ordered array of oligonucleo- 
tides is deposited on a glass slide by using 
microarray printing technologies. Each oligo- 
nucleotide includes a handle compatible with 
a downstream sequencing reaction, a barcode 
specific for each spatial position, a random 
UMI (unique molecule identifier) used to cor- 
rect polymerase chain reaction (PCR) ampli- 
fication bias, and a polythymidine [poly(T)] 
sequence used to capture polyadenylated [poly 
(A*)] messengers. Each array spot features a 
different spatial barcode. A thin histological 
section is then placed on the array, permeabil- 
ized to allow the cellular RNA to diffuse to the 
barcodeed oligos, and reverse-transcribed in situ 
to produce spatially indexed cDNAs. The latter 
are then amplified, adapter-ligated to produce 
a library, and sequenced by using standard 
next-generation sequencing. Some key advan- 
tages are the ability to produce long sequenc- 
ing reads (although still biased for the 3’ end 
of transcripts), lack of reliance on complex im- 
aging instruments, high speed (the time re- 
quired for processing is effectively independent 
of sample size), and potential for paralleliza- 
tion. These features greatly facilitated com- 
mercial adoption of the technology. However, 
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there are some notable drawbacks, including 
low resolution ( currently 50 to 100 um), a high 
cost per sample, and low efficiency of RNA cap- 
ture (Table 1). 

The resolution of ST was greatly improved 
a few years later by two methods, Slide-seq 
(68) and high-density ST (HDST) (69), which 
attach the barcoded poly(T) primers to nano- 
beads (which can have a much smaller diam- 
eter) rather than directly to the glass surface. 
The beads are then packed on an adhesive 
slide (Slide-seq) or on a microwell-patterned 
slide (HDST), reaching a much higher spatial 
density (10 um for Slide-seq and 2 um for 
HDST). Because the packing is random, the 
barcodes are first decoded spatially with a 
round of ISS, and the slides are subsequently 
used for a standard ST protocol. Compared 
with original ST, Slide-seq and HDST enable 
profiling at a resolution comparable with the 
size of single cells, but this comes at the cost of 
amore complicated protocol and reagent set 
up and a lower efficiency of RNA capture. A 
second version of Slide-seq pushed capture 
efficiency to the same neighborhood of dis- 
aggregated single-cell RNA-sequencing (scRNA- 
seq) (70). A more recent method from the same 
authors, Slide-tags, uses the same approach to 
add spatial addresses to single cells that are 
then processed by using regular disaggregated 
pipelines, enabling multi-omic spatial profil- 
ing with established techniques (77). 

Another method, “Seq-Scope,” provides a 
substantial advance in the field by “double- 
dipping” into one of the most widely used 
next-generation sequencing platforms. In this 
approach, a modified sequencing flow cell is 
used to generate randomly distributed DNA 
barcodes by means of bridged PCR amplifica- 
tion. The clusters are then mapped with se- 
quencing by synthesis, enzymatically processed, 
and extended to remove the distal sequencing 
handle and install a UMI and a poly(T) se- 
quence (72). This effectively produces a very 
dense barcoded slide with a different spatial 
barcode every ~500 nm, for which the spatial 
coordinates of each spot are already known. 
The slide is then used for a slightly modified 
ST protocol, and a second round of sequencing 
is used to determine RNA abundance. Two 
other approaches, Stereo-seq (spatial enhanced 
resolution omics sequencing) and PIXEL-seq 
(polony-indexed library-sequencing) use a car- 
pet of DNA “nanoballs” growing on a slide to 
achieve similar submicrometer resolution. Both 
methods have a relatively high capture effici- 
ency, on par or slightly above scRNA-seq (73, 74). 

A coarser way to produce spatial barcodes, 
which is nevertheless very effective, uses a 
series of microfluidic channels pressed on 
top of a tissue sample to create a physical bar- 
rier. In DBIT-seq (deterministic barcoding in 
tissue for spatial omics sequencing), barcoded 
oligonucleotides flowing through each channel 
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are ligated to target biological molecules, after 
which the microfluidic chip is rotated 90°, and 
the barcoding is performed a second time. This 
procedure produces a grid, in which each inter- 
section has a different index. This method has 
been used to profile RNA (75), proteins (76), 
and chromatin features (77, 78), either alone or 
in combination (75, 79). Resolution varies be- 
tween 10 and 50 um depending on the chip 
size, with higher resolution being more tech- 
nically challenging. Capture efficiency is quite 
high, likely because of the missing requirement 
of diffuse RNAs from the tissue onto a barcoded 
support (Table 1). 

Spatial barcoding techniques combined 
with bulk sequencing have been very success- 
ful. The availability of commercial options, 
nonreliance of high-precision instruments, 
and compatibility with existent workflows 
has lowered the entry barrier for many groups. 
Data analysis is also comparably simple be- 
cause all processing happens at the level of 
sequence data, an area of expertise more wide- 
spread in biological laboratories than image 
analysis. However, the low resolution and ran- 
dom positioning of the spatial barcodes, with 
respect to the sample, means that the infor- 
mation can sometimes average several cells of 
different types, confounding results. In spite of 
this, ST and its derived techniques have been 
successfully applied to many studies—in par- 
ticular, in the fields of neuroscience and cancer 
biology (80-82)—and improved methods will 
presumably push adoption spatial profiling 
techniques even further. 


The balancing act of spatial profiling 


In an ideal world, a spatial profiling method— 
whether it targets RNA, proteins, or DNA— 
would offer a near-complete and unbiased 
image of the entire molecular content of a cell, 
at subcellular resolution and in a short period 
of time, reliably and cheaply enough to be per- 
formed on many samples, including archival 
samples. All current technologies, however, en- 
tail certain compromises, and the choice of ap- 
proach therefore depends on the specific needs 
of each particular study. Resolution, through- 
put, ease of implementation, robustness, and 
cost represent key variables in the spatial pro- 
filing balancing act (Table 1). 

For protein profiling, the main decision is 
whether to invest in the IMC-MIBI ecosystem 
or opt for cyclic immunofluorescence proto- 
cols. IMC and MIBI are commercially available, 
robust, and (for IMC) widely used. Although 
antibody conjugation and validation can be 
time consuming, a growing catalog of pre- 
conjugated reagents is available, and there are 
many examples of successful studies that use 
this technology, including some large-scale 
ones (8, 77-22). However, detection maxes out 
at fewer than 50 different markers, sensitivity 
is limited, and amplification methods are not 
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Fig. 4. Spatial barcoding methods. All methods are based on the production of 
a solid surface with a regular array of capture probes (DNA oligonucleotides), 
terminating with a poly(T) sequence that matches cellular mRNAs. In ST and 
10X Visium, the capture probes are built directly by means of microarray 
printing, resulting in a resolution of 50 to 100 um. In Slide-seq and HDST, the 
probes are synthesized on gel beads, which are then randomly attached to a 
slide or deposited on a microwell chip. The position of each bead is then detected 
by short rounds of ISS. In Seq-Scope and PIXEL-seq, the capture probes are 
produced by modifying sequencing clusters produced by bridge amplification on 
an Illumina flow cell or on a polyacrylamide-based hydrogel, whereas in Stereo- 


yet available. Moreover, imaging time and cost 
are substantial, requiring a large investment in 
both instruments and reagents. Last, resolution 
is relatively coarse on IMC and generally insuf- 
ficient to detect any subcellular structure beyond 
nuclei and membranes. MIBI offers a higher res- 
olution, but at the cost of even slower processing. 

Cyclic immunofluorescence has a much lower 
entry barrier because of its lower cost, and it 
offers higher throughput. It can detect more 
than 100 features given enough time and anti- 
body availability. Moreover, some incarna- 
tions of the technology are compatible with 
native antibodies, reducing the need for op- 
timization. Last, resolution is much higher, 
both in the vy and z dimensions (with con- 
focal imaging options available). On the other 
hand, compared with MS, fluorescence imag- 
ing has a lower SNR, which makes some mark- 
ers difficult to detect. Cyclical imaging also 


Bressan et al., Science 381, eabq4964: (2023) 


requires nontrivial data processing because 
multiple images need to be aligned, and tissue 
damage can become substantial after multi- 
ple cycles. 

Considerations are much more complex in the 
spatial transcriptomics area. If very high resolu- 
tion is needed—for example, to resolve transcripts 
in different subcellular compartments—imaging- 
based methods (whether hybridization or se- 
quencing based) are the clear winners. High 
resolution, however, makes them slow; imaging 
time scales with both sample area and number 
of target features (because more cycles are nec- 
essary), and the very small field of view neces- 
sitates extensive tiling, image post-processing, 
and registration. 

Of the two subcategories, hybridization meth- 
ods have better sensitivity but tend to produce 
weaker fluorescence signals (although ampli- 
fication strategies are available) and may re- 
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seq, they are produced by means of RCA on a silicon wafer. Spatial location is 
provided by ISS of the capture probes. In all cases, a tissue section is then placed 
on top of the arrayed capture probes and digested, letting the mRNAs in the 
tissue reach the poly(T) region, and cDNA is synthesized in situ. In DBiT-seq, 
barcodes are created directly on top of biomolecules (DNA fragments, cDNA, or 
oligo-conjugated antibodies) by placing a microfluidic chip with parallel thin 
channels on the section and performing ligation with a different oligonucleotide in 
each channel. The chip is then rotated by 90°, and ligation is repeated, producing 
an xy grid with resolution ranging from 10 to 50 um. In all cases, the indexed 
biomolecules are then profiled in bulk by means of sequencing. 


quire reoptimization for different tissues owing 
to different optical properties. They also perform 
best with relatively thin sections (10 to 20 um). 
In situ sequencing protocols, on the other hand, 
unvariably use signal amplification and produce 
stronger signals, require less optimization for 
different tissue types, and can image thicker 
samples (up to 100 um or so). However, in situ 
sequencing exhibits the lowest sensitivity of all 
methods considered here, and when signal am- 
plification is used to improve the SNR of hy- 
bridization methods, sensitivity tends to also 
decrease. This might not be a deal breaker be- 
cause a sparse sampling of the transcriptome 
can still be informative as long as relative dif- 
ferences in expression are maintained. How- 
ever, some of the most useful genetic markers 
(such as transcription factors) are expressed at 
very low levels and may be hard to detect with 
methods having low sensitivity. 
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Untargeted profiling without the need for a 
predesigned probe panel is another advantage 
of in situ sequencing methods (especially ExSeq, 
which offers full-length reads), allowing for ac- 
curate quantification of splicing isoforms and 
mutations. These methods also offer whole- 
transcriptome profiling. The sensitivity of un- 
targeted methods, however, tends to be lower 
than that of targeted ones because of the low 
efficiency of in situ RT. Also, recent hybridization- 
based methods effectively profile thousands of 
genes, enough to identify cell types, states, and 
pathway activity and extrapolate the expression 
of most of the remaining transcriptome (83). 

Resolution and throughput are also critical 
parameters to consider for some barcoding- 
based methods. For the most part, these meth- 
ods cannot associate a spatial barcode with a 
specific cell (because barcodes are randomly 
distributed on the slide relative to cell positions, 
and often overlap cell-cell boundaries). The 
most widely used technology at the moment, 
ST, has a low resolution limited to 20 to 100 
cells per barcode, which makes it substan- 
tially harder to profile spatial neighborhoods 
in detail. Other protocols reach resolutions 
ranging from 10 um per barcode (Slide-seq) to 
~500 nm per barcode (PIXEL-seq, Seq-Scope, 
and Stereo-seq), and microfluidics-based bar- 
coding (DBIT-seq) also reach 10 um. None of 
these solutions, however, is “cell aware”. Fur- 
thermore, sensitivity is relatively low (mainly 
because of the need for in situ RT), and cost per 
sample is generally higher than that of other 
methods. However, barcoding methods offer 
considerable advantages in terms of through- 
put because the acquisition time does not scale 
with either the sample size or the number of 
features detected, and multiple samples can be 
processed in parallel and sequenced together 
in bulk. 

Last, laser capture dissection or spatial tag- 
ging methods such as TIVA or PIC should be 
considered when deep profiling of just a few 
spatial locations is needed. These methods are 
often simpler to implement and allow direct 
purification of selected material for analysis 
through a variety of bulk profiling methods. 


Big data to useful data—the challenge 
of analysis 


Data acquisition is often assumed to be the 
most difficult step in spatial omics methods. 
However, data manipulation, analysis, and vis- 
ualization are just as important if not more so. 
Spatial profiling methods generate such large 
datasets (often many terabytes), have influ- 
enced and biased by so many factors, that an 
experiment that costs thousands of dollars can 
be rendered completely useless without the 
appropriate analytical tools. The raw data are 
sometimes so large that they cannot be properly 
visualized without specialized hardware and 
software. 
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Some key steps of the analysis pipeline for 
each technology are illustrated in Fig. 5. Meth- 
ods based on imaging face the immediate 
challenge of realigning images across the dif- 
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- spatially variable genes 


ferent image cycles and of stitching multiple 
fields of view into a coherent mosaic. Image 
registration is a nontrivial problem that has 
been extensively reviewed elsewhere (84, 85), 
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made more complex in spatial profiling by the 
very large volume of data. Most methods resort 
to using easily identifiable “fiducial landmarks” 
(fluorescent beads) as features to calculate the 
registration matrix (Fig. 5), which make the 
registration much easier at the cost of addi- 
tional experimental processing. In addition, 
most transcriptomic methods require a decod- 
ing step in which the sequence of fluorescent 
signals is matched to one of the genes in the 
panel, or resolved as a de novo sequence, and 
error correction and detection is applied. These 
processing pipelines need to be tailored to ac- 
count for twists and aberrations specific to each 
technique or microscope and thus are diffi- 
cult to standardize and optimize. Recently, the 
SpaceTx consortium made a substantial effort 
by releasing Starfish, a set of python modules 
successfully appliable to most methods (86), al- 
though generally underperforming each meth- 
od’s dedicated pipelines. 

Although most image-based protocols have 
subcellular resolution, almost all downstream 
analysis operates at the single-cell level. Once 
a molecule is detected in space, it needs to be 
assigned to a cell, and all the measurements 
taken over a cell area need to be integrated 
into a single abundance score for each gene or 
protein, ultimately creating a cell-by-feature 
matrix. To do this, raw images are segmented 
to identify areas that belong to individual cells. 

Image segmentation is a central problem in 
both bioimaging and computer vision and has 
benefitted in the past few years from rapid 
advances in artificial intelligence (AI) and deep 
learning (87-89). Although nuclei segmenta- 
tion benefits from the regular nuclear shape 
and the availability of a universal stain (DNA 
intercalators), to define cytoplasmic membrane 
boundaries is a more complex task. Membrane 
markers often generate low signal, have poor 
specificity for the cytoplasmic membrane, or 
are not generic for all cell types and thus 
cannot produce effective segmentation even 
when AI models are used. Although combin- 
ing multiple labels has shown some good re- 
sults (4), the development of a true universal 
membrane marker for segmentation would be 
transformative for the spatial omics field. 

Spatial barcoding methods bypass most im- 
age analysis requirements because the data 
are in the same form as traditional sequencing 
methods, and it is already naturally matched to 
a specific location. Processing closely resembles 
genomics pipelines and is more standardized 
and portable, which represents a major advan- 
tage for this type of methodology. However, the 
“locations” to which barcoded reads are mapped 
do not necessarily correspond to specific single 
cells, and therefore the results are not a cell-by- 
feature matrix but correspond to “bins” of var- 
ious size. In high-resolution methods such as 
Seq-Scope or PIXEL-seq, multiple bins can be 
summed to produce a cell-level database upon 
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segmentation, which of course brings back 
the requirement for image analysis. 

Once the initial processing is completed, the 
dataset is in a form comparable with that of 
disaggregated single-cell profiling methods, 
with the addition of spatial coordinates for 
each cell, and often with substantially higher 
cell counts. Tools designed for disaggregated 
data [such as the Seurat, ScateR, Scanpy, and 
Monocle packages (90-94)] can provide good 
results but need to be used cautiously because 
the nuances of data generation can cause biases. 
Some tools have been upgraded to handle spatial 
data, introducing data structures capable of hold- 
ing coordinates and the ability to produce spa- 
tial plots and graphs. Dedicated packages have 
also begun to emerge, often integrating direct 
visualization of the raw data (for imaging-based 
methods) with analysis of the cell-by-feature 
matrix [for example, Giotto VitesSce, Histocat, 
Cytomapper, SquidPy, and others (95-99)]. Com- 
mercial options are also available. 

Although standard dimensional reduction, 
clustering, and differential expression tools can 
operate effectively on spatial data, they fail to 
effectively leverage the most important feature 
of spatial profiling methods: space. The rapid 
development of spatial omics methods with 
ever-increasing resolution, throughput, and 
power has run ahead of the development of 
statistical frameworks and analysis tools capa- 
ble of harnessing the data to generate new 
insights. This void is now starting to fill, with 
new algorithms being made available almost 
weekly. Although a review of this area would 
exceed the space available, and has been done 
in more detail elsewhere (100), there are three 
main areas of advancement: (i) identification 
of spatially variable markers and of specific 
expression patterns, through methods such as 
Gaussian process regression, spatial autocor- 
relation, and adversarial neural networks 
(101, 102); (ii) identification of spatial niches 
characterized by a set of coexpressed markers, 
at the level of multicellular areas or as a re- 
placement or refinement for cell segmenta- 
tion, under the assumption that cells could 
be defined as areas with a relatively uniform 
gene expression compared with other cells 
(103, 104); and (iii) identification of spatial 
interactions between different cell types (tis- 
sue neighborhoods), detecting whether two 
cell types are likely to colocalize in certain 
areas or conditions, and whether this corre- 
lates with tissue function or disease (105). This 
can be complemented by ligand-receptor an- 
alysis, a measurement of cell-cell functional 
interactions already introduced in disaggregated 
scRNA-seq (106-108), which really shines when 
combined with cell-cell proximity information. 


The challenge of experimental design 


As the saying goes, “when you have a hammer, 
everything looks like a nail.” Spatial profiling 
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technologies are wonderful new instruments, 
and it is tempting to apply them to every prob- 
lem just because one can. To move forward, 
however, users will need to carefully formulate 
questions that can truly benefit from spatial 
omics and address them with experimental 
plans with a solid statistical foundation. 

One application is the identification of new 
cell types and states within tissues, under 
both physiological and pathological conditions. 
This is the professed objective of the Human 
Cell Atlas project, which was initially focused 
on disaggregated methods but now is shifting 
to spatial omics technologies (3). These meth- 
ods can reveal residual heterogeneity in gene 
or protein expression not accounted for by 
traditional cell-type definitions and correlate 
it with spatial location. Another appropriate 
question for these methods is whether colo- 
calization of different cell types influences their 
expression profile or correlates with tissue 
function or disease outcome. This is the focus 
of several recent large-scale projects, investi- 
gating, for example, how tumors evolve and 
respond to treatment (17, 21, 22). The ability 
to simultaneously profile many genes and pro- 
teins also facilitates discovery of new markers 
for specific spatial niches. Because these technol- 
ogies can produce detailed molecular profiling 
of tissues in their native state—without ar- 
tefacts from dissociation, lysis, and averaging 
of different cells—they enable mechanistic in- 
vestigation in situ of phenomena that could 
previously be approached only in simplified 
models or in a purely descriptive way. 

Cell types, or states, characterized by a con- 
sistent marker profile can often be identified 
in spatial data by using the same tools used for 
disaggregated methods. These tools may be 
sufficient to define previously unknown pop- 
ulations even before location is considered. If 
the sample shows spatial consistency, cells of 
the same type can be manually pooled on the 
basis of prior knowledge of tissue anatomy— 
for example, comparing the abundance and 
expression profile of particular neurons in dif- 
ferent cortical areas (5). If the spatial organiza- 
tion of the tissue is not consistent, a potential 
solution is to use spatial information to extract 
neighborhood scores or define “neighborhood 
clusters.” Scores reflect the frequency of con- 
tacts between different cell types, whereas clus- 
ters are assigned cells on the basis of the type of 
other cells they contact or their presence with- 
in a given range. These parameters are often 
directly comparable across samples even if the 
spatial structure itself is different. This approach 
has shown, for example, that colocalization of 
cells of different types in breast cancer has prog- 
nostic value (8). 

One challenge in using spatial omics meth- 
ods to characterize cell types is the limited 
number of features measured at each loca- 
tion. One strategy to overcome this limit is to 
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combine spatial measurements with disag- 
gregated single-cell methods, which often offer 
deeper profiling. The assumption behind such 
“data integration” is that a limited number of 
markers is sufficient to map each cell or location 
detected in the spatial method onto a cell class 
or state in the disaggregated data, and the ad- 
ditional measurements taken in the latter can 
then be reprojected in space. This approach 
has been used for both protein measurements, 
matching IMC and CyTOF (cytometry by time 
of flight) data (109), and RNA measurements 
(4). The latter case is particularly interesting 
because the authors were able to impute the 
values missing in the spatial data (in this case, 
a developing mouse embryo), producing a 
dataset that included more than 10,000 gene 
measurements per cell. Key to this integration 
is a well-chosen set of overlapping markers 
to join the two technologies. Currently, these 
are often chosen through manual curation or 
by identifying the most relevant cluster-specific 
features from disaggregated data. However, 
cluster-free a priori methods to identify optimal 
markers have recently been proposed (1/0). 
Notably, data integration is only possible for 
samples for which identical (or very similar) 
samples can be used for both disaggregated 
and spatial methods and in which both meth- 
ods are likely to capture the whole heterogeneity 
of cell types and states present in the sample. 

Replication is a central issue for much of 
the field, yet only possible if there is some 
degree of consistency in the spatial structure 
of samples. When studying tissues with a 
stereotypical organization, such as a develop- 
ing embryo or the brain cortex, it is relatively 
simple to acquire multiple replicates and aver- 
age the results. This is a challenge for path- 
ological tissues (such as tumors) that have a 
highly heterogeneous, and not consistent, spa- 
tial structure. To further complicate things, as 
data resolution increases, sample-to-sample 
heterogeneity is revealed even in tissues with 
a seemingly repeatable structure, up to the 
extreme level of every sample being different, 
making replication impossible. This issue is 
already known to the field of disaggregated 
single-cell sequencing, which struggles with 
where to draw the line between one cell type 
or state and another. 

In most cases, there is not yet a strong rec- 
ommendation on the ideal number of repli- 
cates. Although published studies often use 
the “golden rule” of three or more replicates, a 
better option for large studies would be to first 
define the experimental parameters being 
compared, then begin with a small set of pilot 
experiments to estimate their variance across 
samples, and last, enroll the help of a trained 
statistician to calculate a sample size sufficient 
to produce a sound result. One must also con- 
sider the economic feasibility of big sample 
sizes. Application of spatial omics to large- 
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scale projects is still hindered by the cost and 
time investment required by many technolo- 
gies. This will undoubtedly be mitigated by 
further development and broader commercial 
diffusion of the technologies. 

Studying rare or poorly characterized phenom- 
ena (such as micrometastases in distal organs, 
or rare or transient cell types in development) 
presents a different challenge: acquiring a suf- 
ficient number of samples. Because replication 
is a necessity, it is necessary in these cases to 
prescreen tissues with a limited number of rel- 
evant markers, ideally by using a fast, low- 
plex technology such as whole-slide scanning 
or 3D whole-organ microscopy. Areas of interest 
can thus be identified and prioritized for sub- 
sequent detailed molecular analysis. This has 
been done, for example, by using serial two- 
photon tomography (///, 1/2). The best exper- 
imental designs for spatial omics studies thus 
often include different but complementary 
technologies. 


Future directions 


We are still at the dawn of the spatial omics 
era, and the future undoubtedly holds just as 
many revolutions and advancements as the 
past few years have seen. 

Spatial profiling methods are going to be- 
come faster, more reliable, and more powerful 
with time. This will likely go hand in hand with 
amore extensive offer of commercial instru- 
ments and reagents. A handful of companies 
already offer “turnkey” solutions, with more 
set to launch. Commercial adoption in turn 
will facilitate wider access to these technol- 
ogies and contribute to the democratization 
of spatial omics. 

Profiling depth is likely to continue increas- 
ing. Imaging-based RNA measurement meth- 
ods have already reached whole-transcriptome 
levels, at least in cell culture, and this trend 
will likely extend to tissues. For proteins, whole- 
proteome measurements are presently unat- 
tainable, but the number of measurable markers 
will likely expand from less than 50 to hundreds, 
although this might require a move beyond 
MS-based methods and production of a large 
number of highly specific antibodies. Although 
ultradeep measurements will be possible, they 
will likely remain confined to special-use cases 
because garnering additional data must be 
balanced against increased experimental com- 
plexity. Untargeted in situ sequencing and spa- 
tial barcoding methods, which will no doubt 
improve in both specificity and resolution, will 
continue to represent a very valuable alternative, 
and the latter might well become the leading 
solution for large-scale studies. 

We also anticipate a further extension of ex- 
isting technologies to a wider range of mea- 
surements. Most existing methods profile either 
mRNA abundance or protein expression. More 
recently, RNA quantification has been achieved 
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with multiplexed immunostaining methods 
that use fluorescent or metal conjugated probes 
(16), and conversely, oligonucleotide-conjugated 
antibodies have been used in both cyclic hy- 
bridization (48) and spatial barcoding meth- 
ods (76). 

Genomic and epigenomic measurements 
are also increasingly performed spatially. 3D 
chromatin organization has been profiled 
genome-wide, alongside gene expression, by 
using both seqFISH+ and MERFISH (/73, 114), 
and the latter has also been combined with the 
Cut&Tag protocol (175) for spatial profiling of 
epigenetic marks (1/6). Spatial barcoding meth- 
ods such as microfluidic barcoding and Slide- 
seq have been combined with both Cut&Tag 
and ATAC-seq (assay for transposase-accessible 
chromatin with high-throughput sequencing) 
(117) for spatial profiling of open chromatin 
and histone modifications (77, 78, 118). Last, 
detection of point mutations is possible by 
using ISS of DNA (179). 

Although these different profiling modal- 
ities are still mostly performed one or two at 
a time (proteins + transcriptome, or tran- 
scriptome + genome organization), the future 
will unquestionably see a rise in spatial multi- 
omic technologies, mimicking the evolution of 
disaggregated single-cell technologies. Ulti- 
mately, a technology that incorporates genomic, 
transcriptomic, and proteomics-small-molecule 
measurements is desirable. 

Last, the field will move closer to true 3D 
profiling. Currently, most spatial omics oper- 
ate on thin tissue slices, which are essentially 
2D. In the case of imaging-based profiling, the 
challenge is diffusing probes and antibodies 
through thick samples and obtaining suffi- 
cient SNR during fluorescence imaging. With 
barcoding methods, the oligonucleotides are 
placed in a 2D layer, and thick samples would 
result in multiple cells being captured for each 
position. Although 2D profiling is incredibly 
useful, it is blind to what lays outside of the 
sectioning plane and cannot produce an ac- 
curate image of some tissue niches. Only some 
multiplexed IHC and ISS techniques come 
close to 3D profiling (albeit only for samples 
a few hundreds of micrometers thick) because 
of the presence of strong amplification pro- 
ducing sufficient signal for confocal microscopy. 

To enable 3D profiling more generally, one 
solution is to produce serial thin sections 
from a 3D sample, process each one through 
one of the 2D technologies, and use computa- 
tional methods to realign the data and produce 
a 3D cube. This approach is very expensive in 
terms of processing time, but it is achievable 
with current technologies and has been dem- 
onstrated for both IMC alone, IMC combined 
with serial two-photon tomography, and spatial 
transcriptomics (111, 120, 127). Other approaches 
to 3D profiling will no doubt be developed in 
the coming years. 
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Just a few years ago, the ability to map the 
molecular make-up of every cell in a tissue, and 
interrogate hundreds of markers and millions 
of cells in the same experiment, would have 
been considered more science fiction than sci- 
ence. Now, single-cell and spatial omics tech- 
nologies are advancing at breakneck speed and 
are granting scientists new powers to inves- 
tigate complex biological processes. With this 
power comes the responsibility of assessing 
the limitations of this new breed of methods, 
their ideal application niches, and the best 
way to apply good experimental practices and 
rigorous analysis to the data that is produced. 
As more and more laboratories get access to 
these new-generation techniques, there is no 
doubt that incredible biological insights are 
just around the corner. 
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INTRODUCTION: Gut microbiota can regulate 
the physiology and pathophysiology of the 
host by producing enzymes with functions simi- 
lar to those of the host. However, it is difficult to 
identify these microbial-host isozymes through 
sequencing-based studies because enzymes with 
similar functions in different species may lack 
sequence conservation. An activity-based func- 
tional protein screening framework is more re- 
liable for the discovery and characterization of 
such microbial-host isozymes, which will help 
yield deeper insights into the gut microbiota- 
host cross-talk. 


RATIONALE: To identify potential microbial-host 
isozymes, we set up an enzyme activity screening 
platform, including activity assays for 110 
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enzymes that are functional in various human 
diseases. These enzyme activities were measured 
in stool-derived ex vivo bacteria communities. 
Dipeptidyl peptidase 4 (DPP4) was a prominent 
microbial-host isozyme identified in our screen, 
but little is known about its pathophysiological 
effects on the host. We sought to determine 
whether gut microbial-derived DPP4, like host 
DPP4: (hDPP4), could decrease active GLP-1 and 
thus affect blood glucose homeostasis. 


RESULTS: We identified 71 enzymes with posi- 
tive activity in the human gut bacteria com- 
munities through our enzyme activity screening 
platform, most of which were validated in the 
protein extracts obtained from feces of germ- 
free and specific pathogen-free mice. Among 


these identified enzymes, DPP4 activity had the 
highest statistical effect size (Z factor) among the 
10 human samples. Through human gut bac- 
teria isolation and DPP4 activity screening, we 
discovered that microbial DPP4 was mainly 
produced by Bacteroides spp. Gut microbial 
DPP4: (mDPP4) could degrade active GLP-1(7-37) 
in vitro. However, mDPP4 could not affect active 
GLP-1 levels in chow-fed mice but could de- 
crease active GLP-1 activity and impair glucose 
homeostasis in high-fat diet (HFD)-fed mice or 
dextran sulfate sodium/indomethacin-treated 
mice, suggesting that a damaged gut barrier is 
required for mDPP4 to affect the activity of 
host GLP-1. 

We discovered that the clinical DPP4 inhibitor 
sitagliptin failed to efficiently inhibit mDPP4. 
And by solving the co-crystal of mDPP4 with 
sitagliptin at 1.97-anstrom resolution, we found 
differences in the nature of the binding be- 
tween the drug and mDPP4 compared with 
its binding to hDPP4 that may explain this 
difference in inhibitory effects. A sitagliptin 
clinical trial (www.clinicaltrials.gov identifer 
NCT04495881) among patients with type 2 
diabetes (T2D) (n = 57) and a related fecal 
microbiota transplant of stool from high res- 
ponders and low responders in the present study 
to HFD-fed mice demonstrated that mDPP4 
could limit the efficacy of sitagliptin in indi- 
viduals with T2D and in glucose-intolerant mice. 

To identify a selective inhibitor of mDPP4, 
we screened ~107,000 compounds, and using 
structural modification we identified Dau-d4, 
a derivative of daurisoline, that could selectively 
inhibit mDPP4 activity compared with hDPP4. 
Dau-d4 could increase active GLP-1 levels and 
improve glucose metabolism in diabetic mice, 
and co-administration of Dau-d4 with sitaglip- 
tin further improved blood glucose homeostasis. 


CONCLUSION: Here, we developed an activity- 
based strategy to identify uncharacterized gut 
microbial-host isozymes that provides a deeper 
understanding of gut microbiota-host interac- 
tions. Gut microbial DPP4 isozyme can impair 
host glucose homeostasis, and variations in 
microbial DPP4 activities could possibly con- 
tribute to the heterogeneous responses to 
sitagliptin observed among patients with T2D. 
Our findings highlight the promise of de- 
veloping therapies that target both host and 
gut microbial enzymes to achieve greater 
clinical efficacy. 
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A mechanistic understanding of how microbial proteins affect the host could yield deeper insights 
into gut microbiota—host cross-talk. We developed an enzyme activity—screening platform to 
investigate how gut microbiota—derived enzymes might influence host physiology. We discovered 
that dipeptidyl peptidase 4 (DPP4) is expressed by specific bacterial taxa of the microbiota. 
Microbial DPP4 was able to decrease the active glucagon like peptide-1 (GLP-1) and disrupt glucose 
metabolism in mice with a leaky gut. Furthermore, the current drugs targeting human DPP4, 
including sitagliptin, had little effect on microbial DPP4. Using high-throughput screening, we 
identified daurisoline-d4 (Dau-d4) as a selective microbial DPP4 inhibitor that improves glucose 


tolerance in diabetic mice. 


he gut microbiota contributes to host 
normo- and pathophysiology by producing 
diverse metabolites (7-3) and a variety of 
proteins and peptides (4-9). Although the 
various functions of most gut microbiota- 
derived proteins have not been identified, 
several studies have shown that the gut micro- 
biota can produce enzymes with functions 
similar to those of the host (microbial-host 
isozymes) (10-13). However, proteins with 
similar functions in the host and in different 
bacterial species, so-called isozymes, may lack 
sequence conservation, and thus sequencing- 
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based studies to identify such isozymes may 
not be a reliable means to identify them or their 
biological functions. 

To better identify microbial-host isozymes, 
we developed an activity-based screening plat- 
form as a framework for their discovery and 
characterization that may also represent targets 
for disease intervention because many of the 
enzymes in our screen are known to also be 
involved in host pathophysiology. In our screen, 
we identified isozymes from the human and 
mouse gut microbiota, including dipeptidyl 
peptidase 4 (DPP4), an important therapeutic 
target for the management of type 2 diabetes 
(T2D). DPP4 is responsible for the degradation 
of GLP-1(7-36 amide) or GLP-1(7-37), both of 
which are called “active” GLP-1 and are secreted 
from intestinal enteroendocrine L cells that play 
an important role in the coordination of post- 
prandial glucose homeostasis, mainly through 
the activation of GLP-1 receptor (GLP-1R) (4, 15). 
The degradation products of active GLP-1, 
GLP-1(9-36 amide) or GLP-1(9-37), were unable 
to activate GLP-1R, leading to the generally con- 
sidered inactive form of GLP-1 (16). The sum- 
mation of the pools of active GLP-1 and inactive 
GLP-1 equals the pool of total GLP-1. 

Homologous proteins of human DPP4 with 
different amino acid sequences have been found 
in many other organisms, including bacteria 
and fungi. For example, DPP4 has been iden- 
tified by others in periodontopathic bacteria, 
and when it was injected into mice, their glucose 
tolerance was altered, indicating that the micro- 
bial DPP4 was functionally similar to host DPP4 
(7, 18). However, current clinical DPP4 inhibi- 
tors, which target human DPP4, exhibit low 
inhibitory activity toward periodontopathic 
bacterial DPP4 (79, 20). Although it has been 
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speculated that the gut microbiota has DPP4- 
like activity (21), it is not known which specific 
gut commensal bacteria produce DPP4 iso- 
zymes or what effects such gut microbiota- 
derived DPP4 may have on host health. In our 
work, we discovered that in mice with a damaged 
gut barrier, gut microbial DPP4 decreased the 
pool of active GLP-1 independently of host 
DPP4, thus impairing host glucose homeosta- 
sis. Clinically relevant DPP4 inhibitors, includ- 
ing sitagliptin, that are used to treat T2D have 
a weak inhibitory effect on gut microbial DPP4, 
which may explain the variable response to 
sitagliptin in certain T2D patients. Using high- 
throughput screening, we identified Dau-d4, a 
derivate of daurisoline (Dau), as a specific micro- 
bial DPP4 inhibitor that ameliorated metabolic 
dysfunction in mice mainly through its action 
on the microbial DPP4-host GLP-1 axis. 


Results 
Functional screening for microbial-host isozymes 


We established a screening platform to iden- 
tify microbial-host isozymes that have the poten- 
tial to affect host function (Fig. 1A). A major 
challenge with a mixed fecal culture approach 
is maintaining the diversity and composition 
of the microbiota during ex vivo culturing 
(13, 22-24). We collected fresh human fecal 
samples from three healthy volunteers and 
cultured them for 2 days in 10 different types 
of media commonly used in gut microbiota 
cultivation, followed by 16S ribosomal RNA 
(tRNA) gene sequencing. We found that brain 
heart infusion (BHI) medium supported the 
growth of the bacterial community while main- 
taining a composition and diversity similar to 
those observed in the fecal samples (fig. S1, A 
and B) (22). To reduce overgrowth of facul- 
tative anaerobes such as Enterococcus and 
Escherichia-Shigella (13, 22), we selected fluo- 
romethalone as a supplement for BHI (mBHI 
medium) (fig. $1, C and D) (25). By collecting 
and culturing 10 fresh stool samples in mBHI 
medium, we found that the stool-derived ex vivo 
communities were maximally similar to the 
original fecal microbiota (fig. SIE). 

We selected 110 enzymes that are functional 
in various human diseases, including US Food 
and Drug Administration-approved drug tar- 
gets and other disease target enzymes in the 
human key drug-target database SuperTarget 
(26) (fig. SIF). Protein extracts from the above 
10 stool-derived ex vivo communities were pre- 
pared for the activity of microbial-host isozymes 
(see the materials and methods for details). A 
combination of fecal enzymes may be a better 
representation of the in sitw enzyme activity 
of the microbiota than a separation-based 
single enzymatic activity assay. Among 10 tested 
human stool-derived ex vivo communities, we 
identified 51 enzymes with a statistical effect 
size (Z factor) of >0.5 in five or more individual 
samples and 20 with a Z factor >0.5 in one to 
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four individual samples (Fig. 1B). It should be 
noted that discrepancies in the activity of the 
same enzymes between individuals may re- 
sult from differences in the composition and 
content of their gut microbiota. 

We also measured the activities of above 71 
potential microbial-host isozymes from fecal 
samples obtained from germ-free (GF) and con- 
ventionally raised (CONV-R) mice. We found that 
the enzymatic activity of 56 (78.9%) microbial- 
host isozymes was higher in feces from CONV-R 
mice than in that from GF mice (fig. S1G). 


Microbial DPP4 decreases active GLP-1 under 
leaky gut 


In our screen, we found that DPP4 had the 
highest average Z factor among the 10 human 
samples (Fig. 1B). We also found that the acti- 
vity of this enzyme in the feces of GF mice was 
less than that of the enzyme extracted from 
CONV-R mice (5.5-fold; fig. SIG). Several DPP4: 
inhibitors have been developed and used in 
the clinical management of T2D (J4, 27), but 
their clinical efficacy varies among patients 
for unknown reasons (28). To assess the physio- 
logical function of microbial DPP4 in vivo, 
chow-fed specific pathogen-free (SPF) mice 
were orally treated with a broad-spectrum anti- 
biotic cocktail for 1 day (Fig. 1C). The efficiency 
of gut bacteria removal was monitored by mea- 
suring the relative abundance of the 16S rRNA 
gene (fig. S2A). Although antibiotic treatment 
significantly decreased fecal DPP4 activity (fig. 
S2B), no changes in DPP4 activity or host DPP4 
concentrations were observed in the extralu- 
minal intestinal tissue (Fig. 1D and fig. S2C). 
This is important because it has been reported 
that intestinal DPP4 activity is required to 
degrade GLP-1 and thus regulate glucose toler- 
ance (29). Moreover, active and total GLP-1 levels 
in the extraluminal intestinal tissue and plasma 
of chow-fed mice did not change after antibiotic 
treatment (Fig. 1, E and F, and fig. S2, D and E). 
In the intact gut of chow-fed mice, GLP-1 is 
secreted from L cells of the intestine and is 
released basolaterally directly into the blood- 
stream (15). Thus, gut microbial DPP4 may 
decrease GLP-1 when it crosses the intestinal 
barrier. It is known that high-fat diet (HFD) 
feeding can induce lipotoxicity and altera- 
tions in the shape and distribution of the gut 
microbiota (i.e., gut dysbiosis), which can 
damage the intestinal barrier. Such a leaky 
gut induced by HFD feeding allows microbial 
macromolecules such as lipopolysaccharide 
and bacterial proteins into the bloodstream 
(30-33). Therefore, we speculate that a HFD- 
induced leaky gut allows microbial DPP4 to 
decrease the activity of host GLP-1. 

To investigate this theory, we tested the effect 
of microbial DPP4 on HFD-fed mice (Fig. 1C). 
After a 12-week HFD feeding regimen, which 
increased intestinal barrier permeability (as 
indicated by a fluorescein isothiocyanate- 
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dextran permeability assay and changes in 
the gene expression of tight junction proteins; 
fig. S2, F and G), the antibiotic treatment 
group showed significantly less DPP4: activity 
Gncluding both host and microbial DPP4) in 
their feces and extraluminal intestinal tissue 
compared with control mice given antibiotic- 
free drinking water (Fig. 1G and fig. S2, H and 1, 
with no change in intestinal host DPP4 levels 
(fig. S2J). When lower DPP4 activity was de- 
tected, active GLP-1 levels were higher in extra- 
luminal intestinal tissue and plasma compared 
with the control group (Fig. 1, H and I), whereas 
total GLP-1 levels remained unchanged (fig. S2, 
K and L). Fecal microbiota transplantation 
(FMT) from SPF mice to antibiotic-treated mice 
reversed the effect of antibiotics on DPP4 
activity and the active GLP-1 levels (Fig. 1, G to I, 
and fig. S2, H to L). DPP4 globally deficient 
(Dpp4'-), HFD-fed mice (Fig. 1J) retained 
DPP4 activity in the extraluminal intestinal 
tissue because of the presence of gut microbial 
DPP4 along with a leaky gut despite the loss of 
host DPP4 protein. Conversely, DPP4: activity 
was absent when the Dpp4! ~ mice were treated 
with antibiotics, which correlated with greater 
pools of active GLP-1 in both the extraluminal 
intestinal tissue and the plasma compared with 
the control-treated Dppt! ~ mice (Fig. 1, K to M, 
and fig. S2, M to Q). 

To confirm that microbial DPP4-mediated 
decrease of active GLP-1 requires increased 
intestinal permeability, we gave chow-fed mice 
a low concentration (1%) of dextran sulfate 
sodium (DSS) (34). DSS carries a strong neg- 
ative charge that induces erosion of the gut 
epithelium, so we used it to test whether this 
source of damage to the gut, rather than HFD 
feeding per se, allows microbial DPP4 to escape 
the gut lumen (fig. S3, A and B). We found that 
antibiotic treatment in DSS-treated mice showed 
similar changes in DPP4 activity and active 
GLP-1 levels compared with the HFD-fed mice 
(fig. S3, C to K). Moreover, in an indomethacin 
drinking model, another mouse model of in- 
creased intestinal permeability caused by the 
inhibitory effect of indomethacin on protec- 
tive prostaglandins (35) (fig. S3, L and M), we 
observed similar changes in DPP4 activity and 
active GLP-1 levels as with the DSS and HFD 
models (fig. S3, N to V). Therefore, in mice with 
a damaged gut barrier, the gut microbiota 
appears to contribute to DPP4 activity and to 
the decrease of active GLP-1 independently of 
host DPP4. Gut microbial DPP4 thus appears 
to be able to reduce active GLP-1 levels in vivo 
under conditions similar to the leaky gut that 
often occurs in patients with T2D (36, 37). 


Identification of DPP4-producing bacteria 


To identify microbial producers of DPP4, we 
collected fresh stool samples from 19 healthy 
volunteers, cultured the microbial commun- 
ities ex vivo, and tested DPP4 activity. We chose 
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two samples with the highest DPP4 activities 
from which to isolate and identify bacterial 
strains (fig. S4A). We established a library of 
272 culturable isolates representing six phyla 
(Fig. 2A and fig. S4B). By using the GLP-1 mimics 
as assay substrates, we detected potent DPP4 
activity among the Bacteroidetes phylum (Fig. 
2A), including Bacteroides thetaiotaomicron, 
B. fragilis, B. eggerthii, B. vulgatus, and B. dorei 
(Fig. 2A). Among these, B. thetaiotaomicron 
showed the greatest activity with His-Ala-p- 
nitroaniline and Gly-Pro-p-nitroaniline (Fig. 2A). 

Phylogenetic analysis was performed on 
24 proteins, including 19 DPP4 homologs from 
the above Bacteroides spp., previously reported 
DPP4 homologs from periodontal bacteria 
(7, 18), and human DPP4: (fig. S4C). We ex- 
pressed and purified 14 candidate enzymes 
and measured their DPP4 activity (Fig. 2B, 
fig. S4D, and table S1). Although sequence 
alignment suggested that 19 enzymes in five 
Bacteroides spp. were homologs to human 
DPP4 (hDPP4), not all the bacterial gene pro- 
ducts were functionally similar to hDPP4. 
B. thetaiotaomicron peptidase 1, B. fragilis 
peptidase 1, B. eggerthii peptidase 1, B. vulgatus 
peptidase 1, and B. dorei peptidase 1 (btDPP4, 
bfDPP4, beDPP4, byDPP4, and bdDPP4, respec- 
tively) showed potent DPP4 activity, whereas 
others had no significant DPP4 activity (Fig. 
2B). All of the Bacteroides spp. DPP4: fell into 
a clade distinct from that of oral bacterial 
DPP4 (fig. S4C). Sequence analysis also showed 
that the Bacteroides DPP4 protein homologs 
contain an N-terminal signal peptide, a dipep- 
tidyl IV domain, and a peptidase_S9 domain 
(fig. S4E). By searching with our proven micro- 
bial DPP4 proteins in both the oral cavity and 
stool in the Human Microbiome Project (HMP- 
1-1), we found that Bacteroides-encoded DPP4 
is mainly distributed to the stool, with little 
occurrence in the oral environment (fig. $4, 
F to I). It has been reported that bacterial 
species have extensive functional variation be- 
tween strains (38). Using profiling DPP4 
enzyme-encoding genes in the genome of 1245 
strains from Bacteroides spp. obtained from the 
National Center for Biotechnology Information 
(NCBD, we found that such genes occur widely 
among the sequenced Bacteroides spp. strains 
(fig. S4J and table S2). 

Using enzyme kinetic assays, we found that 
the ratio of the apparent unimolecular rate 
constant to the Michaelis constant (4.a;/Km) of 
btDPP4 was greater than that of other micro- 
bial DPP4 isozymes and hDPP4 (Fig. 2, C and D). 
Further, btDPP4 showed the highest activity 
toward GLP-1 among all of the DPP4 pro- 
teins tested, including hDPP4 (Fig. 2E). 


Microbial DPP4 disrupts host glucose homeostasis 


To explore the role of microbial DPP4 in host 
glucose metabolism, Bacteroides DPP4 proteins 
from five species were expressed in E. coli 
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Fig. 2. Identification of DPP4-producing bacteria and microbial DPP4. 

(A) Testing of the in vitro DPP4 activity of bacteria strains isolated from human 
stools [His-Ala-pNA and Gly-Pro-p-nitroaniline (Gly-Pro-pNA) as substrates], 

n = 3 biological replicates per group. (B) Heterologous expression and 
purification of candidate DPP4s from five Bacteroides spp. (B. thetaiotaomicron, 
B. fragili, B. eggerthii, B. dorei, and B. vulgatus); previously reported DPP4 
homologous from periodontal bacteria (Porphyromonas gingivalis, P. endodon- 
talis, Tannerella forsythia, and Prevotella intermedia) and hDPP4. Protein DPP4 
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activity was tested in vitro (His-Ala-pNA as a substrate), n = 3 biological 
replicates per group. (C) Michaelis-Menten saturation curves for hDPP4 and 
nine active microbial DPP4 homologs (His-Ala-pNA as a substrate), n = 3 
biological replicates per group. (D) keat/K, of HDPP4 and nine active microbial 
DPP4 homologs calculated from the Michaelis-Menten saturation curve in 

(C), n = 3 biological replicates per group. (E) Activity of hDPP4 and nine active 
microbial DPP4 homologs against GLP-1(7-37), n = 3 biological replicates 

per group. Data are presented as means + SEM. 
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Nissle 1917 (EcN) by genomic integration 
through CRISPR-Cas9. First, compared with 
the phosphate-buffered saline (PBS)-gavaged 
group, EcN colonization had no significant 
effect on DPP4 activity or on active GLP-1 levels 
(fig. S5, A to D). Next, a single dose of EcN 
expressing different microbial DPP4: constructs 
was orally administered to HFD-fed mice (Fig. 


3A). All groups were successfully colonized 7 
days after administration (fig. S5E), and all of 
the experimental groups showed significantly 
higher fecal DPP4 activity than control mice 
colonized with EcN treatment (fig. SSF). Higher 
DPP4 activity in extraluminal intestinal tissue 
was observed only in the EcN::btDpp4, EcN:: 
bfDpp4, and EcN::bvDpp4 colonized groups 


(Fig. 3B), and there was similar intestinal host 
DPP4 levels in all of the colonized groups com- 
pared with the control EcN group (fig. S5G). 
As DPP4 activity changed, the levels of active 
GLP-1 in the extraluminal intestinal tissue and 
plasma were significantly lower in the EcN:: 
btDpp4, EcN::bfDpp4, and EcN::bvDpp4 groups 
(Fig. 3, C and D), with unchanged levels of total 
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Fig. 3. Microbial DPP4 decreases the pools of active GLP-1 and disrupts 
glucose metabolism in HFD-fed mice. (A) Experimental scheme for (B) to (G), 
n= 6 mice/group. Mice were fed a HFD for 12 weeks, divided into six groups, and 
given a single gavage of GFP-containing EcN, EcN::btDpp4, EcN::bfDpp4, EcN:: 
beDpp4, EcN::bvDpp4, or EcN::bdDpp4 for 1 week. (B) EL intestinal tissular DPP4 
activity (His-Ala-pNA as a substrate). (C) Concentrations of EL intestinal tissular 
active GLP-1 levels. (D) Plasma levels of active GLP-1. (E) Glucose-stimulated 
insulin levels. (F and G) Oral glucose tolerance test (OGTT) curve (F) and its area 
under the curve (AUC) (G). (H) Experimental scheme for (I) to (K), 7 = 6 mice/ 
group. Dpp4’~ mice were fed a HFD for 12 weeks and administered a single 
gavage of EcN or EcN::btDpp4 for 1 week. (I) EL intestinal tissular DPP4 activity 
(His-Ala-pNA as a substrate). (J) Plasma levels of active GLP-1. (K) Glucose- 
stimulated insulin levels. (L) Experimental scheme. Mice were fed a HFD for 

12 weeks, divided into three groups, and administered the same dose of 


anaerobic PBS control, Bt, or Bt-ADpp4 by gavage for short-term or long-term 
studies. For the short-term experiments in (M) to (0), n = 5 mice/group were 
given a single gavage and lasted for 1 week; for the long-term experiments in 
(P) to (R), n = 6 mice/group were gavaged once per week and lasted for 6 
weeks. (M) EL intestinal tissular DPP4 activity (His-Ala-pNA as a substrate). 
(N) Plasma levels of active GLP-1. (0) Glucose-stimulated insulin levels. 

(P) Plasma levels of active GLP-1. (Q and R) OGTT curve (Q) and its AUC (R). 
Data are presented as means + SEM. For (B) to (G), one-way ANOVA with 
Tukey's post hoc test: *P < 0.05, **P < 0.01, ***P < 0.001 compared with the 
EcN. For (I) to (K), two-tailed Student's t test: *P < 0.05, ***P < 0.001 compared 
with the Dpp4’’~ + EcN group. For (M), (N) to (P), (R) and (Q), one-way ANOVA 
with Dunnett's T3 test (M) or Tukey's post hoc test [(N) to (P) and (R)] and 
Kruskal-Wallis test (Q): *P < 0.05, **P < 0.01, ***P < 0.001 compared with the 
control group, ##P < 0.01, ###P < 0.001 compared with the Bt group. 


GLP-1 (fig. 85, H and I). Moreover, the EcN:: 
btDpp4, EcN::bfDpp4, and EcN::bvDpp4 groups 
had lower glucose-induced insulin levels and 
poorer glucose tolerance than control EcN- 
gavaged mice (Fig. 3, E to G). Colonization ex- 
periments gavaging EcN-btDPP4 into Dpp4! ~ 
mice were used to exclude the possibility that 
host DPP4 was decreasing active GLP-1 (Fig. 
3H and fig. S5J). As expected, similar effects of 
EcN::btDpp4 were observed in Dpp4/ ~ mice 
compared with wild-type mice (Fig. 3, I to K, 
and fig. S5, K to P), but host DPP4 in the intes- 
tine was not detectable in either group. 

To explore the role of btDPP4 in the effect of 
B. thetaiotaomicron on glucose homeostasis, 
we constructed a Dpp4-deficient strain of 
B. thetaiotaomicron JSO71 (Bt-ADpp4) (fig. S6A) 
and used it to colonize wild-type SPF-housed 
mice on a HFD and compare with wild-type 
B. thetaiotaomicron (Fig. 3L). Bt-ADpp4 showed 
no DPP4 activity toward GLP-1, but also showed 
no growth deficit compared with wild-type 
B. thetaiotaomicron JSO71 (Bt) (fig. S6, B to D). 
One week after the colonization of HFD-fed 
mice with Bt (fig. S6, E and F), we found that 
DPP4 activities in the feces and extraluminal 
intestinal tissue were both higher than in the 
control group given anaerobic PBS (Fig. 3M 
and fig. S6G), whereas the host intestinal DPP4: 
concentrations were identical among the dif- 
ferent treatment groups (fig. S6H). Active 
GLP-1 concentrations in extraluminal intestinal 
tissue and plasma in the Bt group were lower 
than those in the control (Fig. 3N and fig. S6D, 
although total GLP-1 levels were unchanged 
(fig. S6, J and K). After wild-type Bt treatment, 
mice exhibited lower glucose-stimulated insu- 
lin levels and poorer glucose tolerance than 
the anaerobic PBS-treated control mice (Fig. 30 
and fig. S6, L and M). Colonization with Bé- 
ADpp4 blunted these various effects of Bt on 
DPP4 activity, active GLP-1 levels, and glucose 
homeostasis (Fig. 3, M to O, and fig. S6, F to 
M). Long-term colonization of Bt for 6 weeks 
(Fig. 3L) resulted in similar effects as 1 week 
of colonization, and body weight and food 
intake remained unchanged between the two 
groups (Fig. 3, P to R, and fig. S6, N to W). 


Wang et al., Science 381, eadd5787 (2023) 


To eliminate the effect of endogenous bacteria 
on Bt-induced glucose disruption, we colonized 
GF mice treated with 1% DSS with wild-type 
Bt or Bt-ADpp4 (fig. S7, A to C). Like the effects 
in SPF mice, gavage with either wild-type Bt or 
purified btDPP4 delivered by L100-55 resulted 
in higher DPP4 activity, lower active GLP-1 levels, 
and impaired glucose tolerance compared with 
controls (fig. S7, D to M); these effects were not 
apparent in the Bt-ADpp4 colonization group 
(fig. S7, D to M). 

Because the above sequence analysis showed 
that btDPP4 contained an N-terminal signal 
peptide (fig. S4E), we next investigated whether 
Bt impaired host glucose homeostasis through 
the secretion of btDPP4. We observed DPP4: 
activity in the B. thetatotaomicron JSO71 cul- 
ture supernatant (fig. S8, A and B). Further- 
more, a btDPP4 signal peptide knockout strain, 
Bt-ASP-Dpp4, was established (fig. S8, C and D). 
As expected, no DPP4 activity was detected 
in the culture supernatant of Bt-ASP-Dpp4, 
whereas the bacterial pellets showed higher 
DPP4 activity in the Bt-ASP-Dpp4 strain com- 
pared with wild-type Bt (fig. SSE). We next 
gavaged HFD-fed mice with either Bt or Bt- 
ASP-Dpp4, leading to similar levels of abun- 
dance after gavage (fig. S8, F and G). Only 
the Bt group showed a significant increase in 
DPP4 activity in extraluminal intestinal tissue 
and a decrease of active GLP-1 (fig. S8, H to J) 
compared with control. 


Clinical DPP4 inhibitors do not inhibit 
microbial DPP4 


We next investigated whether this interking- 
dom pathway is susceptible to inhibitors that 
target hDPP4. We found that the clinical DPP4 
inhibitor sitagliptin (39) was ~78-fold less active 
against btDPP4 than hDPP4 (Fig. 4A and fig. 
SQA). All of the clinically relevant DPP4 inhi- 
bitors that we tested showed weak inhibition of 
DPP4 activity and did not affect the growth of 
B. thetaiotaomicron (fig. S9). 

We then solved the apo structure of btDPP4: 
to a resolution of 1.92 A (Fig. 4B and table S3). 
There are two dimers in one crystallographic 
unit cell. Each monomer comprises two struc- 
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tural domains: an N-terminal, eight-bladed 
6-propeller with open “Velcro” topology and a 
C-terminal o/B-hydrolase catalytic domain 
(Fig. 4B). The dimeric interface of the btDPP4 
structure is predominantly mediated by hydro- 
phobic interactions of the catalytic domain and 
two protruding antiparallel B-sheets from pro- 
peller blade IV (Fig. 4B). With an amino acid 
sequence identity of 32% to hDPP4, the overall 
architecture of btDPP4 could be superimposed 
on hDPP4 [Protein Data Bank (PDB) ID 1R9N] 
with a root-mean-square deviation (RMSD) of 
14 A over 611 Ca atoms (fig. S10A, left). The 
structure of btDPP4 resembles previously solved 
structures of pgDPP4 (PDB ID 50LJ) from 
Porphyromonas gingivalis (19) (RMSD = 0.9 A), 
smDPP4 (PDB ID 2ECF) from Stenotrophomonas 
maltophilia (40) (RMSD = 1.8 A), and pmDPP4 
(PDB ID 5YP1) from Pseudoxanthomonas 
mexicana WO24 (20) (RMSD = 1.6 A) (fig. 
S10B). Structural comparison of the catalytic 
domains between btDPP4 and hDPP4 indi- 
cated that the conserved S606, D681, and H713 
residues in btDPP4 act as the catalytic triad 
(fig. S10A, right). A hydrophobic pocket near 
Y638 may serve as the S1 pocket, and the S2 
subsite is formed by a hairpin R-loop contain- 
ing R113, an E201-E202 motif, Y524, and Y642 
(fig. S10A, right). Subsequent site-directed 
mutagenesis of btDPP4 followed by enzymatic 
assays showed that most of the above residues 
(except for R113) resulted in a loss of activity 
(fig. S10C), thus verifying their roles in btDPP4 
function. 

To elucidate the molecular basis for the weaker 
inhibitory effect of sitagliptin on btDPP4 rela- 
tive to hDPP4, we co-crystalized btDPP4 with 
sitagliptin and solved the complex structure to 
a resolution of 1.97 A (Fig. 4C, fig. SIOD, and 
table $3). Both a 2mF,-DF, map and an omit 
map indicated the binding of sitagliptin at this 
site (Fig. 4C and fig. S10D). Structural compa- 
rison with the sitagliptin-bound hDPP4 (PDB 
ID 1X70) (41) revealed common and distinct 
recognition features between btDPP4 and hDPP4: 
(Fig. 4D). The 2,4,5-trifluorophenyl moiety and 
the oxobutan-amine of sitagliptin (fig. SIOE) 
adopted nearly identical conformations and 
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Fig. 4. The clinically relevant DPP4 inhibitor 
sitagliptin exhibits a weaker effect on btDPP4 
than on hDPP4. (A) In vitro inhibition curve of 
sitagliptin against either hDPP4 or btDPP4. (B) The 
overall dimer structure of btDPP4 shown in a 
cartoon representation. One protomer is shown in 
gold and one in pale green. The dimeric interface is 
shown in orange and deep green. (C) 2mF,-DF, 
map (blue mesh, o = 1.5) of btDPP4-bound 
sitagliptin. The binding pocket is shown as a gray 
surface, and the side chains of some key residues 
are shown as sticks. (D) Active-site super- 
imposition between sitagliptin-bound btDPP4 (pale 
green) and sitagliptin-bound hDPP4 (gray, PDB: 
1X70), with key residues labeled as one-letter 
codes. Both inhibitors are shown as ball-and-stick 
figures. The trifluoromethyl group of the bound 
sitagliptin in btDPP4 flipped by ~40° to avoid steric 
clash with btDPP4-E342 compared with that in 
hDPP4. The same color schemes were applied to 
all figures. F, phenylalanine; S, serine; R, arginine; 
E, glutamic acid; Y, tyrosine. (E) Schematic 
diagram of the clinical trial design for evaluating 
the effects of microbial DPP4 on the clinical 
efficacy of sitagliptin. (F) The change of HbAic 
over 3 months of sitagliptin intervention between 
the SHR group (n = 34) and the SLR group (n = 
23). (G) Fecal DPP4 activity of the SHR and SLR 
groups (His-Ala-pNA as a substrate). 

(H) Volcano plot of metagenomic sequencing data 
of the SHR and SLR groups. The graph depicts the 
average log2 ratio of abundances between both 
groups for each individual species and the 
corresponding adjusted P value. The adjusted 

P value threshold was calculated using moderated 
Student's t test followed by Benjamini-Hochberg 
multiple test correction false discovery rate (FDR). 
(I) Taxonomic cladogram generated from linear 
discriminant analysis (LDA) effect size of meta- 
genomic sequencing data. Blue indicates enriched 
taxa in the SHR group; pink represents enriched 
taxa in the SLR group. (J) Relative levels of 
microbial DPP4 mRNA in human feces as detected 
by RT-qPCR. (K) Correlation analysis between 
microbial DPP4 mRNA levels and fecal DPP4 
activity, the change (before treatment minus after 
treatment) of fasting blood glucose, HbA;,, fasting 
insulin, and fasting C-peptide. Spearman's rank 
test: *Q < 0.05, **Q < 0.01, Q value (FDR-adjusted 
P value). Data are presented as means + SEM. For 
(G), two-tailed Students’ t test and for (J), Mann- 
Whitney U test: **P < 0.01, ***P < 0.001. 


conservatively interacted with E201, E202, Y524, 
S606, and Y642 in btDPP4, which correspond to 
E205, E206, Y547, S630, and Y666 in hDPP4 
(Fig. 4D). In the hDPP4-sitagliptin structure, 
the trifluoromethyl triazolopyrazinyl (TMTP) of 
sitagliptin forms additional interactions, includ- 
ing n-n stacking with F357 and hydrogen bonds 
with S209 and R358. However, in the btDPP4- 
sitagliptin structure, this TMTP moiety under- 
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goes ~40° of flipping to avoid steric hindrance 
with E342, and thus disallows the formation of 
stable interactions to the small helix linking 
blade V and blade VI (the V/VI linker motif) 
(Fig. 4D). Sequence alignment revealed that 
the TMTP-stabilizing residues (F357, S209, and 
R358) in hDPP4 are not conserved in btDPP4, 
whereas the protruding btDPP4-E342 residue 
that occludes the trifluoromethyl] group is miss- 
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ing in hDPP4 (fig. SIOF). Therefore, the weak 
TMTP contact observed in the btDPP4-sitagliptin 
complex structure likely accounts for the 
decreased inhibitory effect of sitagliptin on 
btDPP4 (Fig. 4A), and the variable V/VI linker 
motif provides a structural basis for explaining 
the distinct inhibitory efficiencies of gliptins 
against different DPP4 homologs. Thus, inhibi- 
tion of hDPP4 alone may not be able to fully 
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restore the concentration of active GLP-1, and 
it seems necessary to inhibit both human and 
microbial DPP4 to improve glucose homeostasis 
more effectively. 


Microbial DPP4 limits the efficacy of sitagliptin 


Because of the weaker inhibitory effects of cli- 
nical DPP4 inhibitors toward microbial DPP4, 
we next tested the effect of gut microbial DPP4 
on the therapeutic effectiveness of clinical DPP4 
inhibitors. Serum and stool samples were col- 
lected from 57 patients with newly diagnosed 
T2D before and after sitagliptin treatment for 
3 months (fig. S11A) (42), and changes in gly- 
cated hemoglobin (HbA,;,) were monitored to 
assess the therapeutic effect of sitagliptin (Fig. 
4, and table S4). After sitagliptin intervention, 
high interpersonal variability in glucose homeo- 
stasis assessment of HbA,, was observed, indi- 
cating heterogeneous responses to sitagliptin 
among the cohort. According to the diagnostic 
criteria for T2D (43) and previous sitagliptin 
clinical trials (44-47), we defined the patients 
whose HbA,, decreased to <6.5% or decreased 
by >1% during sitagliptin treatment as sitagliptin 
high-responders (SHRs, 7 = 34); the other pa- 
tients were defined as sitagliptin low-responders 
(SLRs, 2 = 23) (Fig. 4F). There was no significant 
difference in the baseline of physiological indi- 
cators, including glucose metabolism, between 
SHRs and SLRs before sitagliptin treatment (fig. 
SIIB). SHRs exhibited greater improvement in 
fasting blood glucose than did SLRs, with equal 
levels in fasting insulin and fasting C-peptide 
(fig. S11, C to E). The baseline DPP4 activity in 
stool samples of SLRs was significantly higher 
than that in SHRs (Fig. 4G). 

Further exploring the gut microbiota of the 
two categories of patients (table S5), we did not 
observe any overall difference in the diversity 
indices between the two groups (fig. S11, F and 
G); however, we did observe an enrichment of 
B. thetaiotaomicron among the SLRs (Fig. 4H). 
Linear discriminant analysis effect size analy- 
sis also showed the Bacteroides genus and 
B. thetaiotaomicron as distinguishable markers 
between the SHRs and SLRs groups (Fig. 41), 
mirroring the DPP4 activity analysis performed 
in the bacterial isolates (Fig. 2A). Both gene reads 
and mRNA levels of btDPP4 and bvDPP4 were 
enriched in the SLR group (Fig. 4J and fig. S11H). 
btDPP4 (both gene reads and mRNA levels) 
negatively correlated with the improvements 
of fasting blood glucose and HbA,, and showed 
a positive correlation with fecal DPP4: activity 
(Fig. 4K and fig. S11D, indicating that the 
abundance of btDPP4 and bvDPP4 may reflect 
the fecal DPP4: activity in individuals with T2D. 

To investigate the potential effect of the gut 
microbiota (especially B. thetaiotaomicron and 
btDPP4) on the clinical efficacy of sitagliptin, 
we performed FMT experiments transferring 
human stool into HFD-fed mice pretreated with 
antibiotic for 1 day and then with sitagliptin for 
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6 weeks (fig. S12A). We obtained a bacteriophage 
strain (BéP) that selectively kills B. thetaiotaomicron 
(48, 49) (fig. S12, B to E. As a control, we used T7 
phage (50), which targets E. coli, an abundant, 
non-DPP4-producing microbe in our cohort 
(fig. S12F). As expected, mice transplanted with 
the gut microbiota from SLRs had a higher 
abundance of B. thetaiotaomicron (fig. S12E), 
as well as elevated DPP4 activity (fig. S12, 
Gand H) and no change in the intestinal host 
DPP4 content (fig. S121), and the mice showed 
lower active GLP-1 concentrations and impaired 
glucose tolerance (fig. S12, J to M). T7 phage 
treatment substantially reduced the abundance 
of E. coli in the feces (fig. S12F); however, the 
glucose homeostasis was not altered (fig. S12, 
G to M). By contrast, in the SLR + BéP group, 
clearance of B. thetaiotaomicron (fig. S12E) 
reversed the negative effects induced by SLR 
FMT on sitagliptin treatment (fig. S12, G to M). 


Dau-d4 is a specific microbial DPP4 inhibitor 


Our results indicated the potential of thera- 
peutically targeting microbial DPP4 to treat T2D. 
Therefore, to identify inhibitors of microbial 
DPP4, we developed a high-throughput drug- 
screening system using the btDPP4 protein and 
tested the inhibitory effects of a small-molecule 
library containing ~107,000 compounds (Fig. 5A). 
At a concentration of 10 uM, 10 compounds 
inhibited btDPP4 activity by >90% (hit rate 
0.01%; Fig. 5B); the low hit rate indicated that 
the screen was selective. We measured the 
median inhibitory concentration (IC;9) values 
(fig. S13) and among the 10 hits, Dau, an alka- 
loid natural product found in a variety of medi- 
cinal herbs, including Menispermum dauricum 
(51), showed the strongest inhibitory activity 
against btDPP4 (ICs = 0.37 uM). It also had 
high selectivity because almost no inhibition of 
hDPP4 was observed (fig. S13B). 

A series of Dau derivatives were obtained 
for preliminary structure-activity relationship 
studies (fig. S14, A and B). We found that Dau-d4 
(Fig. 5C and fig. S14C), synthesized by methoxyl- 
ation of the hydroxyl groups, showed improved 
btDPP4 inhibitory activity (ICs) = 88 nM) over 
the native molecule and no hDPP4 inhibitory 
activity (Fig. 5D). Using isothermal titration 
calorimetry, we found that Dau-d4 bound to 
btDPP4 at a Kp of 0.27 uM (Fig. 5E), whereas 
sitagliptin bound btDPP4 at a Kp of 12.6 uM 
(fig. S14D). 

Dau-d4 showed strong inhibitory effects 
against bfDPP4, beDPP4, bvDPP4, and bdDPP4: 
(Fig. 5F). It also inhibited DPP4 activity in 
Bacteroides spp. at the strain level (fig. S14E) 
and in ex vivo communities obtained from 
healthy volunteers (fig. S14F). By contrast, sitag- 
liptin had little effect against Bacteroides spp. 
at the protein, strain, and ex vivo levels (fig. 
$14, E to G). 

Next, we determined the co-crystallized struc- 
ture of the btDPP4-Dau-d4 complex to a reso- 
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lution of 2.74 A (Fig. 5G, fig. SI5A, and table $3). 
Because the tetrahydroisoquinoline moieties on 
both sides of Dau-d4 (fig. SI5B) are too large to 
be coordinated into the S1 pocket (Fig. 5G), the 
binding sites for Dau-d4 to btDPP4 must be dis- 
tinct from that of sitagliptin (fig. S15C). Specifi- 
cally, the tetrahydroisoquinoline A moiety and 
the phenyl A ring form two z-cation interactions 
with the side chains of S2 subsite residues R113 
and Y524 (Fig. 5H). The other half of the tetra- 
hydroisoquinoline and phenyl moiety forms 
hydrophobic interactions with W605 and Y716 
and hydrophilic interactions with R721, com- 
pletely occluding the binding of substrate p1’ and 
p2' residues at the S1’' and S2' subsites (Fig. 5H). 
The importance of these residues was confirmed 
by mutations to Ala that showed marked reduced 
binding affinities for Dau-d4 (Fig. 51). 

Because Dau-d4 is more selective for btDPP4 
than for hDPP4, we superimposed the btDPP4- 
Dau-d4 complex structure with hDPP4 and 
found that several residues surrounding Dau-d4 
in btDPP4 were not conserved in hDPP4 (fig. 
S15D). Among them, P110/Y716/R721 from 
bunch 1, Y522/Q531 from bunch 2, and the 
V/VI linker motif were the major contributors 
to the specificity of Dau-d4 for btDPP4 (fig. 
S15D). Individual mutations of these sites to 
the corresponding residues in hDPP4 attenu- 
ated Dau-d4 binding affinity to varied extents 
(fig. SI5E). By contrast, we were also able to 
engineer hDPP4 to acquire the capacity for 
Dau-d4 binding by introducing the single 
mutation of either hDPP4-K554 to glutamine 
(K554Q) or hDPP4-F357 to glutamic acid (F357E) 
(fig. SI5F). This conversion of hDPP4 giving it 
the ability to effectively bind Dau-d4 was even 
more pronounced when a chimeric hDPP4 
protein containing the btDPP4-V/VI linker mo- 
tif replacement (hDPP4-V354~P362—>btDPP4- 
1340~1348) was tested (fig. SI5F). These three 
hDPP4 mutants that could effectively bind 
Dau-d4 retained their activity toward GLP-1(7-37) 
(fig. S15G). Similar to the binding assay results 
(fig. S15F), Dau-d4 showed strong inhibition 
of these three mutants of hDPP4 (fig. S15H). 
These results show that different residue com- 
positions around the active sites, especially the 
V/VI linker motif, are the primary elements 
that determine the specificity of inhibition of 
btDPP4 by Dau-d4 versus its targeting of hDPP4. 


Dau-d4 improves glucose homeostasis 


We next performed Dau-d4 treatment of SPF- 
raised mice on a HFD colonized with an an- 
aerobic PBS control, Bt, or Bt-ADpp4 to further 
test whether Dau-d4 promotes glucose ho- 
meostasis through a btDPP4-dependent path- 
way (Fig. 6A). After successful colonization by 
Bt and Bt-ADpp4 (fig S16, A and B), we found 
that Dau-d4 decreased DPP4: activity in the 
feces and extraluminal intestinal tissue (Fig. 
6B and fig. S16C) without changing the intes- 
tinal host DPP4 content (fig. S16D). Dau-d4 
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the high-throughput screening and evaluation of microbial DPP4 inhibitors. 
(B) Distribution of the inhibition rates against btDPP4 of the individual 
compounds from a 107,000 small-molecule chemical library. (©) Chemical 
structure of Dau-d4. (D) Inhibition curve of Dau-d4 against btDPP4 or hDPP4, 
n = 3 biological replicates. (E) Isothermal titration calorimetry (ITC)-mediated 
determination of btDPP4 and hDPP4 binding to Dau-d4. (F) Inhibition curve of 
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o =1.5) of btDPP4-bound Dau-d4 structure with btDPP4 shown as a gray surface. 
(H) Dau-d4 binding competes with the substrate binding at the active site. 
Substrate p2, pl, pl’, p2’, and p3' residues (Y, tyrosine; P, proline; S, serine; K, 
lysine) are indicated and shown in salmon (PDB code: IRON). (I) Kp values of 
Dau-d4 binding against btDPP4 among different loss-of-function mutations, n = 3 
biological replicates. WT, wild-type btDPP4; W, tryptophan; R, arginine; A, alanine. 


9 of 13 


RESEARCH | RESEARCH ARTICLE 


treated mice increased the pools of active 
GLP-1 (Fig. 6C and fig. S16, E to G), and this 
was associated with higher glucose-stimulated 
insulin levels and improved responses on an 
oral glucose tolerance test (Fig. 6D and fig. 
$16, H and I). Bt colonization blunted the 
effects of Dau-d4, whereas Bt-ADpp4 did not 
(Fig. 6, B to D, and fig. S16, B to I). Dau-d4 
remained effective in Dpps! ~ mice in terms of 
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the beneficial effects on glucose homeostasis 
(Fig. 6, Eto H, and fig. S16, J to M) because it 
targets microbial-derived DPP4. Similar ef- 
fects of Dau-d4 were observed in both DSS 
and indomethacin mouse models of leaky gut 
(fig. S17, A to R). 

We next assessed the chronic effects of Dau-d4. 
Mice fed a HFD for 12 weeks were subsequently 
treated with Dau-d4 for an additional 6 weeks 


(Fig. 61). Dau-d4 did not significantly influence 
body weight or food intake during the 6-week 
treatment period compared with PBS treatment 
(fig. S18, A and B). However, Dau-d4 treatment 
resulted in significantly lower DPP4 activity in 
the feces and extraluminal intestinal tissue and 
greater intestinal and plasma active GLP-1 levels, 
and Dau-d4 treatment improved glucose toler- 


ance compared with PBS control (Fig. 6, J to L 
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Fig. 6. Dau-d4 improves host glucose metabolism. (A) Experimental scheme 
for (B) to (D), n = 6 mice/group. Mice were fed a HFD for 12 weeks, divided into 
five groups, and treated with anaerobic PBS control, Bt single colonization 

(Bt group), Dau-d4 (10 mg/kg) gavage (Dau-d4 group), Dau-d4 (10 mg/kg) 
combined with Bt single colonization (Dau-d4 + Bt group), or Dau-d4 (10 mg/kg) 
combined with Bt-ADpp4 colonization (Dau-d4 + Bt-ADpp4 group), respectively, 
for 1 week. (B) EL intestinal tissular DPP4 activity (His-Ala-pNA as a substrate). 
(C) Plasma levels of active GLP-1. (D) Glucose-stimulated insulin levels. 

(E) Experimental scheme for (F) to (H), n = 6 mice/group. Dpp4-”" mice were 
fed a HFD for 12 weeks, divided into two groups, and administered a PBS control 
containing 1% DMSO or Dau-d4 (10 mg/kg) for 1 week, n = 6 mice/group. 

(F) EL intestinal tissular DPP4 activity (His-Ala-pNA as a substrate). (G) Plasma 
levels of active GLP-1. (H) Glucose-stimulated insulin levels. (1) Experimental 
scheme for (J) to (L), n = 6 mice/group. Mice were fed a HFD for 12 weeks, 
divided into two groups, and then administered a PBS control containing 1% 
dimethylsulfoxide (DMSO) or Dau-d4 (10 mg/kg) for 6 weeks. (J) Plasma levels 


of active GLP-1. (K and L) OGTT curve (K) and its AUC (L). (M) Experimental 
scheme for (N) to (S), n = 6 mice/group. The ob/ob mice were divided into 
four groups and treated with 10 mg/kg Dau-d4, 10 mg/kg sitagliptin, or 10 mg/kg 
Dau-d4 and 10 mg/kg sitagliptin or the same volume of PBS control containing 
1% DMSO for 6 weeks. (N) Fecal DPP4 activity. (O) EL intestinal tissular DPP4 
activity (His-Ala-pNA as a substrate). (P) Concentrations of EL intestinal tissular 
active GLP-1 levels. (Q) Plasma levels of active GLP-1. (R and S) OGTT curve (R) 
and its AUC (S). Data are presented as means + SEM. For (B) to (D), one-way 
ANOVA with Tukey's post hoc test: *P < 0.05, **P < 0.01, ***P < 0.001 compared 
with the control; #P < 0.05, ##P < 0.01, ###P < 0.001 compared with the 
Dau-d4 group; and $P < 0.05, $$P < 0.01, $$$P < 0.001 compared with the 
Dau-d4 + Bt group. For (F) to (L), two-tailed Student's t test: *P < 0.05, **P < 0.01, 
***P < 0.001. For (N) to (S), one-way ANOVA with Tukey's post hoc test [(N), 
(P), and (Q)] or Dunnett's T3 test [(O), (R), and (S)]: *P < 0.05, **P < 0.01, 
***P < 0.001 compared with the control, #P < 0.05, ##P < 0.01, ###P < 0.001 
compared with the Dau-d4 group. 


and fig. S18, C to F). Dau-d4 could significantly 
improve glucose homeostasis of ob/ob mice, a 
genetic model of obesity, without altering body 
weight and food intake (fig. S18, G and H). The 
effect in ob/ob mice was associated with an 
inhibition of the decrease of active GLP-1 (Fig. 6, 
MtoS, and fig. $18, G to I), and the combination 
of Dau-d4 and sitagliptin in ob/ob mice further 
improved glucose tolerance without altering 
body weight, food intake, or intestinal levels of 
host DPP4 (Fig. 6, M to S, and fig. $18, G to I). 
Furthermore, Dau-d4 improved the respon- 
siveness of mice treated with sitagliptin that 
had received FMT with gut microbiota from 
SLRs (fig. S19). 

Dau-d4 had no significant inhibitory effects 
on the growth of several gut bacteria (fig. 
$20A) and had no toxic effects on several host 
cell lines (fig. S20B). To explore the in vivo 
safety of Dau-d4, we administered Dau-d4 to 
SPF mice by gavage with the drug at an effec- 
tive dose (10 mg/kg) and a high dose (100 mg/kg) 
for 6 weeks, and found that neither dose resulted 
in any notable adverse effects, including loss of 
body weight, changes in food intake, or changes 
in liver and kidney function (alanine amino- 
transferase, aspartate transaminase, urea nitro- 
gen, and creatinine; fig. S20, C to H). To further 
explore the stability of Dau-d4 in the native gut 
microbiota setting, we found Dau-d4 levels 
remain unchanged for 4 days in ex vivo fecal 
communities anaerobically (fig. S201). We then 
gave Dau-d4 orally to both SPF and GF mice and 
measured fecal Dau-d4 concentrations at differ- 
ent times. Although the fecal Dau-d4 concentra- 
tion curve of GF mice had an overall backward 
shift, perhaps because of the known slower 
gastrointestinal motility and gastric emptying 
of GF mice (52), the area under the curve for 
Dau-d4 concentration was not significantly 
different between the two groups (fig. S20, J 
and K). Pharmacokinetics analysis showed that 
Dau-d4 had a bioavailability of 6.43% (fig. S20L), 
indicating that Dau-d4 has poor oral absorption 
and may be a gut-restricted inhibitor. Together, 
these results suggest that Dau-d4 may be a safe 
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and effective treatment for T2D, depending on 
the gut microbiota profile of the patients. 

To engineer a more ideal gut-restricted inhib- 
itor with better potential safety and tissue 
targeting, we synthesized a Dau-d4 derivative, 
Dau-d23 (fig. S21A), which also has potential 
inhibitory activity against btDPP4 (fig. S21B). 
Using pharmacokinetics analysis, we found that 
Dau-d23 is a more gut-restricted compound 
than Dau-d4 (fig. S21C), with a bioavailability 
of 3.61%. Moreover, Dau-d23 remained effec- 
tive in HFD-fed mice in terms of the beneficial 
effects on glucose homeostasis (fig. $21, D to K). 


Discussion 


The enzymes produced by the gut microbiota 
that perform similar functions to host enzymes 
may play a role in modulating host health and 
diseases (13, 53, 54), but they remain largely 
unexplored. By developing an experimental 
workflow, we explored the role of microbial 
enzymes in the disruption of host metabolic 
homeostasis and clinical treatment options. 
Although our ex vivo cultures produced several 
microbial-host isozymes, because of the limita- 
tions of enzyme assays, some of the substrates 
that we used were fluorogenic or other mimics, 
which cannot fully represent the activity of 
human enzymes. Also, we should note that our 
enzyme activity workflow can only measure 
the overall activity of each isozyme at a single 
substrate concentration, and the source of the 
isozyme, its gene, and its kinetic analysis deserve 
further attention. Moreover, our workflow was 
mainly based on stool samples; however, it has 
been reported that fecal microbiota compo- 
sition does not accurately reflect the intestinal 
environments such as the colon (55, 56). There- 
fore, our microbial-host isozyme system only 
simulates the activity of fecal microbiota, and 
the activity of different enzymes in diverse 
intestinal segments still needs to be further 
evaluated. 

Our microbial-host isozyme screening plat- 
form identified DPP4 as a major microbial-host 


isozyme that targets GLP-1, a potent incretin 
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involved in glucose homeostasis. Previous stud- 
ies revealed the role of gut microbiota in the 
regulation of host GLP-1 by other mechanisms 
(57, 58), which laid the foundation for studying 
the microbial factors regulating GLP-1. It has 
been speculated that the gut microbiota may 
also include community members with DPP4- 
like activity (27). Our question was, if GLP-1 is 
mainly produced in the extraluminal intestinal 
tissue whereas microbial DPP4 is produced by 
gut microbiota in the gut lumen, then how does 
the microbial DPP4 regulate the bioavailability 
of GLP-1 in the host? 

Gut microbial proteins can cross the intes- 
tinal barrier if its integrity has been compro- 
mised and enter the bloodstream (59). However, 
apart from promoting gut leakiness, a HFD can 
have highly pleiotropic effects on the host, 
including elevated bile acid concentrations, 
overgrowth of proteobacteria, stimulation of 
subclinical inflammatory responses, and altered 
circadian behavior. We used low-dose DSS and 
indomethacin as alternative models for intesti- 
nal barrier damage and observed similar effects 
as for HFD on elevation of microbial DPP4, 
decrease of extraluminal intestinal and plasma 
levels of active GLP-1, and metabolic readouts. 
However, there are also other possible explana- 
tions apart from our speculation. It cannot be 
completely excluded that microbial DPP4 regu- 
lates a microbiota-derived signal that is de- 
tected on the apical border of L cells and alters 
GLP-1 secretion. Unchanged total GLP-1 levels 
in plasma are also determined by renal clear- 
ance of the inactivated peptide rather than cleav- 
age rate. Although all of the treatments tested 
(HED, DSS, and indomethacin) increase intes- 
tinal permeability, they also cause inflammatory 
responses in the gut to varying degrees, which may 
have indirect effects on the GLP-1 secretion rate. 

By showing the benefit of pharmacological 
inhibition of microbial DPP4, our results high- 
light the potential value of targeting this micro- 
bial enzyme in the clinical treatment of patients 
with T2D who show poor responses to currently 
used DPP4 inhibitors. Such a “drugs for bugs” 
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approach in the management of this common 
chronic disease could be expanded to other 
similar human diseases. 


Materials and methods summary 


Detailed materials and methods can be found 
in the supplementary materials, including me- 
thods for the collection of human samples, 
mouse treatments, gut bacteria isolation and 
culture, microbial-host isozyme assays, DPP4 
activity measurement, growth curves of related 
bacterial stains, phylogenetic analyses, reverse 
transcription quantitative polymerase chain 
reaction (RT-qPCR), 16S rRNA gene amplicon 
sequencing and analysis, metagenomic sequenc- 
ing and analysis, the construction of a microbial 
DPP4-expressing F. coli strain, the deficiency of 
DPP4 and SP-DPP4 in B. thetatotaomicron, mea- 
surement of mouse metabolic phenotype and 
intestinal permeability, purification of micro- 
bial DPP4, crystallization and structure deter- 
mination of btDPP4, prevalence analysis 
microbial DPP4, bacteriophage isolation and 
host range assessment, high-through screening 
for btDPP4 inhibitors, synthesis of Dau-d4 and 
Dau-d23, and isothermal titration calorime- 
try assay. 
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INTRODUCTION: Genetic analysis of historical 
individuals has the potential to help restore 
knowledge of people whose stories were omitted 
from written records. During the late 18th and 
early 19th centuries, Catoctin Furnace in Mary- 
land relied on a workforce of enslaved individu- 
als to operate the iron furnace and carry out 
domestic and agricultural tasks. Despite the 
role that Catoctin Furnace played in early US 
history (including supplying munitions during 


or their descendants compared with the fur- 
nace’s later, predominantly white workforce. 


RATIONALE: We produced genome-wide data 
for 27 individuals buried in the Catoctin Furnace 
African American Cemetery and compared them 
to ~9.3 million consenting research partici- 
pants genotyped by 23andMe, Inc., to address 
the following questions: (i) How were the 
Catoctin individuals related to each other? (ii) 


What were the sources of their African and 
European ancestry? (iii) Where in the US do 


the Revolutionary War), relatively little is known 
about the African Americans who labored there 
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Genetic connections to individuals from Catoctin Furnace African American Cemetery in Maryland. 

A timeline highlighting major events in the history of Catoctin Furnace and the birth years of research 
participants in the 23andMe cohort, presented alongside maps showing the proportion of 23andMe research 
participants associated with geographic coordinates in the US, sub-Saharan Africa, and Europe who share 
genetic connections to the Catoctin individuals. 
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direct descendants? (iv) What can their genc__-- 
reveal about their health? 


RESULTS: We identified five genetic families, 
consisting of biological mothers, children, and 
siblings, among the Catoctin individuals. In 
most cases, biological family members were 
buried in close proximity. 

All but one of the Catoctin individuals had 
primarily African ancestry, with variable amounts 
of European ancestry. To learn more about 
their ancestry, we developed an approach to 
detect identical-by-descent segments of the 
genome shared between the Catoctin indivi- 
duals and 23andMe research participants. 
Identical-by-descent segments of DNA are 
shared by two or more people because they 
have been inherited from a recent common 
ancestor. We identified 41,799 close and dis- 
tant relatives of the Catoctin individuals among 
23andMe research participants. Within Africa, 
we found the highest rates of genetic sharing 
between Catoctin individuals and research par- 
ticipants who self-identified as belonging to the 
Wolof or Kongo ethnolinguistic groups. Within 
Europe, we observed the highest rates of genetic 
sharing with research participants that have ties 
to Great Britain and Ireland. 

Within the US, participants from the South 
showed elevated rates of sharing, largely reflect- 
ing distant connections to 23andMe research 
participants with sub-Saharan African ancestry 
(possibly tracing back to shared common ances- 
tors in Africa). When we considered genetic 
relatives who share the most identical DNA with 
the Catoctin individuals, we observed the highest 
rates of sharing in Maryland, suggesting that 
at least some descendants stayed in the region 
after the furnace’s transition away from enslaved 
and paid African American labor. 

Finally, we found that some of the Catoctin 
individuals carried risk factors for sickle cell 
anemia and glucose-6-phosphate dehydrogenase 
deficiency, genetic diseases that are common 
in African Americans today. 


CONCLUSION: These results demonstrate the 
power of joint analysis of DNA from historical 
individuals and the extremely large datasets 
generated through direct-to-consumer ancestry 
testing, and they serve as a model for obtaining 
direct insights into the genome-wide genetic 
ancestry of enslaved people in the historical US. 
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Few African Americans have been able to trace family lineages back to ancestors who died before the 1870 
United States Census, the first in which all Black people were listed by name. We analyzed 27 individuals from 
Maryland’s Catoctin Furnace African American Cemetery (1774-1850), identifying 41,799 genetic relatives 
among consenting research participants in 23andMe, Inc.'s genetic database. One of the highest 
concentrations of close relatives is in Maryland, suggesting that descendants of the Catoctin individuals 
remain in the area. We find that many of the Catoctin individuals derived African ancestry from the Wolof or 
Kongo groups and European ancestry from Great Britain and Ireland. This study demonstrates the power 

of joint analysis of historical DNA and large datasets generated through direct-to-consumer ancestry testing. 


he vast majority of the ~45 million self- 

identified Black and/or African American 

individuals living in the United States 

descend from ~456,600 enslaved Africans 

who were forcibly transported to the US 
from Africa during the transatlantic slave trade 
between 1501 and 1867 (J, 2). However, African 
Americans often have little information about 
these ancestors or their African origins owing 
to a history of inhumane treatment of the en- 
slaved and their descendants, which included 
marginalization and the obfuscation of family 
histories (3). Here, we demonstrate that, when 
combined with genome-wide data from a suf- 
ficiently large and diverse genetic database, 
DNA from historical individuals provides a 
means for restoring knowledge of familial con- 
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nections between contemporary peoples and 
their historical relatives. Specifically, we re- 
port on the DNA of enslaved and free African 
Americans from Catoctin Furnace, Maryland, 
who lived, worked, died, and were buried 
there in the late 18th and early 19th centuries. 

As early as December 1768, a tract of land was 
acquired for the purposes of building an iron 
works at the foot of Catoctin Mountain near 
present-day Thurmont, Maryland (4). The fur- 
nace was in blast by 1776, producing pig iron, 
tools, household items, and munitions used 
during the Revolutionary War. At least 271 
enslaved and an unknown number of free 
African Americans worked at Catoctin, within 
and outside the furnace, as ore miners, col- 
liers, forgemen, fillers, teamsters, and wood- 
cutters, as well as in domestic and agrarian 
roles in the furnace owners’ households and 
plantations (5). In the second quarter of the 
19th century, the furnace’s labor force switched 
primarily to wage labor and a predominantly 
white workforce (6). Gradually, the contri- 
butions of African Americans in this early in- 
dustrial complex were largely forgotten. The 
Catoctin Furnace African American Cemetery, 
near an old ore pit, was excavated in 1979- 
1980 in advance of highway construction 
(Fig. 1A) (7-10). The Maryland State Highway 
Administration transferred stewardship of 
the recovered remains of deceased humans 
to the Smithsonian Institution, where curator 
J. Lawrence Angel conducted preliminary fo- 
rensic anthropological investigations (11). 

The Catoctin Furnace Historical Society, Inc. 
(CFHS), was initially founded to save the Ca- 
toctin Furnace village and its archaeological 
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and architectural heritage from this highway 
construction (4). In recent years, its mission 
has expanded to include restorative justice, 
highlighting the critical role that enslaved and 
free African Americans played in the furnace’s 
history and in the growth of industrial wealth 
and power in the young United States. In 
2015, a grant from the Maryland Heritage Areas 
Authority supported further scientific analysis 
of individuals buried in the cemetery. The first 
phase of the project involved historical docu- 
mentary research and osteological reanalysis 
that refined previous assessments of demog- 
raphy and bone and dental pathology, with 
testing for stable carbon and nitrogen isotopes 
and trace elements to shed light on the life 
histories of the individuals (12). The project’s 
second phase used DNA to explore their bio- 
geographic ancestries and relationships to one 
another, details of which were summarized in 
a technical note (73) and are expanded upon 
here. These data address a critical component 
of CFHS’s mission, pursued jointly with the 
African American Resources Cultural and Her- 
itage (AARCH) Society of Frederick County, to 
return knowledge to the African American 
community and identify descendants of Ca- 
toctin’s enslaved and free workers. 

Advances in ancient DNA (aDNA) technol- 
ogy have made it possible to use genetic data 
as a tool for restoring knowledge of enslaved 
and historically marginalized peoples whose 
stories were often omitted from or disre- 
garded in written records. Studies of the New 
York African Burial Ground (/4, 15), the Anson 
Street ancestors (J6, 17), and others (/8, 19) 
used a combination of anthropological and 
biomolecular tools to provide insight into the 
identity and life history of enslaved individu- 
als through the study of their remains. How- 
ever, the ability of those studies to localize the 
African origins of these individuals was lim- 
ited by the exclusive use of mitochondrial DNA 
(mtDNA) (20) and/or reliance on comparisons 
with data from publicly available reference 
datasets (21, 22). This study shows how deeper 
insights into the precise ancestral origins and 
genetic legacy of enslaved and free African 
Americans, such as those buried at Catoctin 
Furnace, can be obtained from genome-wide 
aDNA data when compared against a refer- 
ence database containing genetic data from 
millions of living people, such as the one 
maintained by 23andMe, Inc. 

We recovered genome-wide aDNA from all 27 
of the Catoctin individuals who were selected 
for sampling, targeting ~1.2 million single- 
nucleotide polymorphisms (SNPs) using a 
capture-based approach (23-26), which we 
combined with imputation to further increase 
the amount of available genetic information 
(table S1). By comparing the DNA of the Ca- 
toctin individuals with genotype data from 
9,255,493 participants in the 23andMe cohort, 
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all of whom consented to participate in re- 
search, we were able to learn about their bio- 
geographic ancestries and genetic relationships 
with one another and to provide insight into 
their genetic legacy by identifying identical-by- 
descent (IBD) connections with living relatives. 


Community engagement and ethical 
considerations 


This research analyzed data from deceased in- 
dividuals who were unable to directly consent 
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to participate in this study, as well as from 
millions of research participants (including 
those genotyped by 23andMe) who actively 
consented to participate in research. The ties 
between present-day African Americans, their 
ancestors within the US, and their ancestors in 
Africa were forcibly severed by the transatlan- 
tic slave trade, the centuries-long institution of 
slavery, and generational systemic racist prac- 
tices that have endured after the abolition 
of race-based slavery (27), as illustrated by 
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Frederick Douglass’s famous words: “Genea- 
logical trees do not flourish among slaves” 
(28). Our objective is to contribute to the res- 
toration of memories of a past community 
whose legacy was intentionally obscured and 
to create an avenue for living people to learn 
about their ancestors. We followed guidelines 
for the ethical analysis of the DNA of historical 
and living people (29, 30), including consulta- 
tion with stakeholder groups, as emphasized 
in recent discussions on the future of studies 
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involving the remains of African Americans 
(31-35). On the basis of interactions with stake- 
holders, we believe there is interest among 
African Americans and the public to harness 
aDNA to learn about historical connections to 
people who lived in the past and to leverage 
this technology to develop accurate methods to 
identify genetic relationships, many of which 
were previously unknown. Equally important 
is the need to communicate the results of 
these analyses with descendants and others 
in a sensitive and accurate manner. 

In the case of Catoctin Furnace, the goals of 
CFHS, developed in partnership with AARCH, 
include identifying descendants and widen- 
ing the community of stakeholders. To date, re- 
searchers at CFHS have traced the lineages of 
two African Americans (one enslaved and one 
free) who labored at Catoctin Furnace to their 
living descendants using historical documents 
and genealogical data. Members of CFHS, 
AARCH, and the recently discovered descend- 
ants all expressed support for the use of ge- 
netic approaches that could identify a larger 
descendant community. On the basis of con- 
versations with these stakeholders about 
their research interests, we sought not only to 
identify living genetic relatives of the Catoctin 
individuals but also to conduct analyses to 
shed light on their life stories, such as identi- 
fying family relationships shared between the 
Catoctin individuals, exploring their African 
and European origins, and identifying biolog- 
ically meaningful variants. In collaboration 
with CFHS and AARCH, we also held a series 
of public events to directly return research 
results at various phases of the project (36). 
One of the ways in which stakeholders chose 
to honor the legacy of enslaved individuals at 
Catoctin during these events was through the 
act of reading the names of individuals that 
could be abstracted from a variety of sources, 
including land records, probate inventories, 
church records, diaries, and freedom-seeker 
ads (although not associated with specific 
burials) (5). We include these names in the 
supplementary text of the supplementary 
materials (36). Establishing family connec- 
tions to living people through genetics con- 
tributes to remembering and honoring those 
buried at Catoctin. 

Here, we show that the joint analysis of DNA 
extracted from the remains of deceased humans 
and millions of living people in a recontactable 
research cohort (i.e., a cohort in which partici- 
pants can be asked follow-up questions or re- 
ceive results), such as the one maintained by 
23andMe, makes it possible to recover previ- 
ously unknown connections between present- 
day people and historical individuals from 
sites like Catoctin Furnace. Not all members of 
the Catoctin stakeholder community have a 
known genetic connection to Catoctin. Although 
this study is responsive to community requests 
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to use genetic approaches to identify descend- 
ants, future studies applying these methods 
should be cautious about the danger of con- 
tributing to the biologization of notions of iden- 
tity, as genetic connections represent just one 
of many ways in which people may feel con- 
nected to historical individuals. A full ethics 
statement is provided in (36). Additional ethical 
considerations involving the coanalysis of 
aDNA and data from private genetic databases 
are discussed in (37). 


Multiple families buried at Catoctin Furnace 


Among the 27 Catoctin individuals, we iden- 
tified five distinct genetic families (labeled A 
to E) that are primarily composed of mothers, 
children, and siblings; in this study, the term 
“genetic family” refers to a group of people 
who are closely related biologically. Similarly, 
specific relationship terms (e.g., mother, son, 
daughter) are used in a biological sense and 
are based on genetic sex as inferred by the 
presence or absence of sex chromosomes; 
thus, they may not reflect the actual kin-based 
relationships, biological presentation, or gen- 
der identities recognized by the Catoctin in- 
dividuals. Fifteen Catoctin individuals could 
be assigned to one of the five genetic families, 
whereas the remaining 12 individuals appear 
genetically unrelated, except for the individ- 
ual from burial 28, whose coverage was insuf- 
ficient to confidently assign to family B. Some 
unrelated individuals share mitochondrial (mt) 
or Y-chromosome haplogroups, which may in- 
dicate more distant relationships that fall 
outside the limits of our resolution. We used 
information about genetic sex and mt and Y 
haplogroups to further resolve these family 
groupings (Fig. 1B). Close genetic relatives 
tended to be buried near one another (Fig. 
1A), whereas individuals who were buried 
separately from their genetic families were 
typically more distantly related. For instance, 
family A consists of a mother (burial 3) and 
two sons (burials 1 and 2), interred side by 
side, in addition to a second- or third-degree 
relative (burial 24) who was buried separately 
and whose exact relationship to the other in- 
dividuals is unresolved. Although genetic re- 
latedness had a role in burial patterning, other 
factors, such as temporal context and cul- 
tural and religious practices, likely contrib- 
uted as well. 


Sex-biased reproduction 


The European ancestry of enslaved African 
Americans originated largely through a pro- 
cess whereby white men reproduced with 
Black women through rape. This gender-based 
sexual violence contributed to the brutal sys- 
tematic enslavement of African Americans and 
frequently produced children born into slavery 
(22, 38). This pattern of behavior, a form of sex- 
biased admixture, is reflected in the distribu- 
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tion of the mt and Y haplogroups observed 
among the Catoctin individuals. Three of the 
16 Catoctin males have Y haplogroups that 
are broadly associated with West Eurasian 
ancestry (fig. S1). These include subclades of 
the Rla and Rib haplogroups that are com- 
mon throughout Europe, indicating that their 
paternal lineages likely trace back to a fully 
European ancestor. In contrast, only one indi- 
vidual (burial 32) has a European-associated 
mt haplogroup (Jiblala) (fig. $2). This indi- 
vidual is an outlier with respect to ancestry, as 
they also have a European-associated Y hap- 
logroup (Rlalalbla3b) and >50% European 
ancestry. Among other possible causes, their 
spatially separated grave, located in the north- 
western edge of the burial ground, may reflect 
their distinct ancestral origins or lack of relat- 
edness within the community represented in 
the cemetery. 


Variable proportions of African, European, and 
Indigenous American-related ancestry in 
Catoctin individuals 


All individuals (except the individual from bur- 
ial 32) were assigned majority African ancestry 
by qpAdm, using a model designed to estimate 
each individual’s African, European, and Indi- 
genous American ancestry proportions (Fig. 
1C and tables $2 and S83). More than one- 
quarter of individuals (n = 7) could be modeled 
as having no detectable European admixture 
(i.e., the amount of ancestry assigned to the 
European source population is within a single 
standard error of zero). This is in sharp con- 
trast to nearly all present-day self-identifying 
African Americans, who typically have at least 
some European-derived ancestry [e.g., there 
is an average of 24% European ancestry among 
23andMe research participants who self-identify 
as African American (39)]. Several individu- 
als could be modeled as having Indigenous 
American ancestry, but in all cases, estimates 
were within three standard errors of zero. It is 
therefore uncertain, on the basis of these an- 
alyses alone, whether these estimates repre- 
sent true Indigenous American ancestry. 

We next imputed genotypes across the whole 
genome for each Catoctin individual using the 
software GLIMPSE (40), with an approach op- 
timized for low-coverage, capture-based aDNA 
data. To ensure that the imputation process 
would not bias results, we tested the perfor- 
mance of the templated positional Burrows- 
Wheeler transform (TPBWT) (41) IBD detection 
tool and 23andMe’s Ancestry Composition tool 
(42) on a dataset composed of 48 high-coverage 
ancient individuals who were down-sampled 
to varying degrees (36). Further, we compared 
the results of ADMIXTURE analysis (fig. S3), 
principal components analysis (fig. S4), and 
qpAdm analysis (fig. S5) using the imputed 
and nonimputed datasets to ensure that the 
imputation process did not substantially bias 
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ancestry estimates. We observed a bias toward 
estimating excess European-related ancestry 
in the lowest-coverage individuals when using 
the imputed dataset relative to the nonimputed 
dataset; however, this did not appear to sub- 
stantially affect individuals with >0.5x cover- 
age, which is the group we focused on for 
subsequent analyses. 

Using the imputed dataset, we applied 
23andMe’s Ancestry Composition tool (42) to 
infer local ancestry along the chromosomes 
of the Catoctin individuals. Each region of their 
genomes was assigned to one of six previously 
determined broad ancestry categories: sub- 
Saharan African, European, East Asian and 
Indigenous American, Western Asian and 
North African, Melanesian, and Central and 
South Indian (fig. S6 and table S4). These 
assignments are correlated with qpAdm es- 
timates (fig. S7) and provide support for the 
identification of Indigenous American ances- 
try in several Catoctin individuals. For instance, 
we inferred low levels of Indigenous American 
ancestry in two brothers (burials 1 and 2) from 
family A but not in their mother (burial 3) 


Family A 


Not | 


ae 


sets 


Uncertain Pedigree 


Ancestry Composition 


— East Asian & Indigenous American 


SS Melanesian 
(a Central & South Asian 


(Fig. 2), suggesting that their unsampled 
father had some Indigenous American ancestry. 


Identity by descent 


To learn more about the biogeographic an- 
cestry and genetic legacy of the Catoctin in- 
dividuals, we searched for IBD segments of the 
genome—that is, long segments of DNA that 
are identical in two or more people because 
they have been inherited from a recent com- 
mon ancestor. We searched for IBD segments 
shared between each of the Catoctin individ- 
uals and ~9.3 million 23andMe research par- 
ticipants. We identified 55,342 IBD segments 
shared between the historical Catoctin in- 
dividuals and 41,799 research participants, 
ranging up to 60 centimorgans (cM) in length 
(Table 1, figs. S8 and S9, and tables S5 and S6). 
We calculated the total IBD shared between 
each pair of individuals to estimate their most 
likely genetic relationship; however, we cau- 
tion that we are likely underestimating the 
true amount of DNA shared between these 
individuals, particularly among close relatives 
(36). In Box 1, we discuss how the relationships 
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between Catoctin individuals and research 
participants with whom they share DNA can 
be interpreted (table S7), noting that not all 
present-day individuals who share DNA with 
Catoctin individuals are direct descendants. In 
fact, most connections are likely between col- 
lateral relatives—relatives who are neither direct 
ancestors nor descendants of one another but 
instead both descend from a common ancestor 
who lived generations before the Catoctin indi- 
viduals. Further, many of the most distant rela- 
tives whom we identified may not share a 
common ancestor who lived in the Americas. 
Instead, their connection may trace back to an 
individual who lived in Africa or Europe before 
their descendants’ arrival in the Americas, either 
willingly or as part of the transatlantic slave trade. 


Genetic connections to present-day Africans 


We examined IBD segments shared between 
Catoctin individuals and members of the 
African cohort (i.e., 23andMe research partic- 
ipants with =95% sub-Saharan African ances- 
try who indicated that either they or all four of 
their grandparents were born in Africa) to 
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Fig. 2. Ancestry composition chromosome paintings for members of 
family A. Chromosome paintings demonstrating the biogeographic ancestry 
assigned to portions of the autosomal chromosomes for four related Catoctin 


individuals assigned to family A—a mother, two sons, and their second- or third- 


degree relative. A likely pedigree for family A (top left) describes their 
relationship to one another, although we note that the true relationship of the 
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individuals from burials 1, 2, and 3 to the individual from burial 24 is uncertain. 
Across the genome, ancestry is assigned to one of six ancestry components 
defined using 23andMe reference populations: sub-Saharan African (purple), 
West Asian and North African (dark blue), European (dark teal), East Asian and 
Indigenous American (dark orange), Melanesian (light orange), and Central 

and South Asian (green). Unassigned regions are shown in white. 
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Table 1. IBD shared between 22 Catoctin individuals and 23andMe 
participants. Summary statistics describing frequencies and amounts of 
IBD shared between the 22 Catoctin individuals with >0.5x coverage and 
subsets of the 23andMe cohort. Participants were included in the US cohort if 
either (i) all four of their grandparents were born in the US or (ii) they 
were born in the US and their grandparents’ birth location information was 
either unavailable or their grandparents were born in multiple countries. 
Similarly, members of the Atlantic African and European cohorts were 


Proportion of 
23andMe 


Number of 


determined using grandparent or participant birth location, with the 
additional requirement of =95% sub-Saharan African ancestry or =99% 
European, respectively. Results for each Catoctin individual are available in 
table S5. Values in columns marked with an asterisk are rounded according 
to the magnitude of IBD sharing as follows: values >100 cM are rounded 

to the nearest ten, values between 30 and 100 cM are rounded to the nearest 
five, values between 10 and 30 cM are rounded to the nearest integer, and 
values <10 cM are rounded to one decimal place. 


Median total IBD Maximum total 
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226,384 
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with =5% sub-Saharan 


0.22% 
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PERE a ae ee ee ee 
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identify the geographical regions in Africa with 
which these present-day people are associated. 
We observed the highest rates of IBD sharing 
between the Catoctin individuals and partic- 
ipants linked to Senegal, Gambia, Angola, and 
the Democratic Republic of the Congo (DRC) 
(Fig. 4A, fig. S10, and table S8), confirming via 
randomization testing that we would be un- 
likely to detect the same number of IBD con- 
nections (n = 10) with an identically sized 
sample (n = 166) of randomly selected indi- 
viduals from the Atlantic African cohort [i.e., 
research participants in the African cohort 
with ties to specific Atlantic African countries, 
defined in (36)] (P < 0.001) (table S9). 
Many African ethnolinguistic groups occupy 
wide geographic ranges that cross present-day 
national borders. We therefore also determined 
the amount of IBD each Catoctin individual 
shared with genetic clusters in Atlantic Africa 
that correspond to the self-identified ethno- 
linguistic groups of research participants (Fig. 
4B and table S10). Of the 15 Catoctin indi- 
viduals with detectable IBD connections to 
Africa, seven share a connection with only a 
single cluster. Six of these individuals have 
high proportions of sub-Saharan African an- 
cestry (>90%). In contrast, it is less common 
for present-day research participants with four 
grandparents born in the US and =50% sub- 
Saharan African ancestry to have a connec- 
tion to only a single cluster (36). Among the 
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Catoctin individuals with connections to At- 
lantic Africans, the most commonly observed 
connections are to genetic groups associated 
with the Wolof, Mandinka, and Kongo (whose 
present-day geographic distribution is described 
in table S11). Overall, the Catoctin individuals 
share relatively more connections with Sene- 
gambian groups, like Wolof and Mandinka, 
than do research participants with four grand- 
parents born in the US and =50% sub-Saharan 
African ancestry (Fig. 4 and supplementary 
text section S5). 


Genetic connections to present-day Europeans 


Next, we explored the rate of IBD sharing with 
members of the European cohort (ie., 23andMe 
research participants with >99% European 
ancestry who indicated that either they or all 
four of their grandparents were born in Europe) 
to learn about the origins of the European- 
related ancestry that we observed in more 
than half of the Catoctin individuals. We 
detect the highest rates of IBD among partic- 
ipants associated with Great Britain and 
Ireland (Fig. 5A and table S12). Randomization 
testing confirms that we would be unlikely 
to detect the same number of IBD connections 
(n = 467) with an identically sized sample (n = 
101,262) of randomly selected individuals from 
the European cohort (P < 0.001) (table S9). 
Much of the Irish-related signal is driven by 
connections to the individual from burial 15, an 
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adolescent male belonging to family C, who, 
when projected onto a graph network of clus- 
ters that correspond to European geography, 
again shares the most IBD with clusters of 
participants associated with the northern, west- 
ern, and southeastern regions of the island 
of Ireland (Fig. 5B). Multiple other Catoctin 
individuals (in particular, those from burials 
2, 10, and 34) share broader connections with 
participants from Great Britain and Ireland 
(Fig. 5B, fig. S11, and table S13). 


Distant and close relatives in the United States 


When we considered IBD sharing between re- 
search participants in the US cohort (i.e., re- 
search participants who indicated that either 
they or all four of their grandparents were 
born in the US) and each of the Catoctin in- 
dividuals separately, each historical individual 
exhibited a distinctive pattern of IBD sharing 
with respect to geography. However, when we 
considered the Catoctin individuals together, 
we observed the highest rates of sharing be- 
tween Catoctin individuals and research par- 
ticipants from the southern US (Fig. 6A, fig. 
$12, and table S14). This signal resembles the 
geographic distribution of 23andMe research 
participants with sub-Saharan African ances- 
try in the US (fig. S13) and is therefore plausibly 
driven by a higher rate of IBD sharing in 
genomic regions with sub-Saharan African 
ancestry. To address this source of bias, we 
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Fig. 3. Examples of chronological 
and genealogical distance 
between historical individuals 
and research participants. 

(A) A timeline showing the years 
in which the African American 
Cemetery at Catoctin Furnace 
was active and a histogram 

of birth years of research 
participants who share 

IBD with Catoctin individuals 
(table S23). (B) Examples 

of relationships that could be 
shared between individuals 

who were born five generations 
apart, with varying degrees 

of genetic separation. Median 
amount of IBD is reported for 
pairs of present-day and historical 
individuals (with 2x coverage) 
(table S7). 
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Fig. 4. Genetic connections to the Catoctin individuals among members 

of the African cohort. (A) The proportion of 23andMe participants in the 
African cohort who share IBD with Catoctin individuals. Geographic coordinates 
are rounded to the nearest integer, and only coordinates that have at least 

25 associated participants after 80% down-sampling are shown. Marker size 
indicates the number of participants associated with each coordinate, and color 
indicates the proportion of participants who share IBD with the Catoctin 


restricted our analysis to research participants 
in the US cohort with =5% sub-Saharan 
African ancestry (Fig. 6B and table S15). This 
filtering strategy increases the rate of IBD 
sharing from 0.45% of all US participants to 
4.25% (Table 1). 

For participants included in this filtered 
dataset, we continued to observe elevated rates 
of IBD sharing with Catoctin in the southern 
US (Gncluding Maryland) (Fig. 6B). We con- 
firmed via a randomization test that we would 
be unlikely to detect the same number of IBD 
connections (n = 2034) with an identically 
sized sample (n = 42,132) of randomly sam- 
pled individuals from the US cohort with 
>5% sub-Saharan African ancestry (P < 0.001) 
(table S9). In contrast, when we filtered to 
include only participants in the US cohort 
with =99% European ancestry to focus on 
the genetic legacy of the admixed Catoctin 
individuals’ European ancestors along line- 
ages with little to no African admixture, we 
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did not detect any clear geographic patterns 
(Fig. 6C and table S12). 

When we focused on “close relatives” (under- 
stood here as pairs of individuals who share 
at least 30 cM of IBD with a Catoctin indi- 
vidual, reflecting a relationship that is predicted 
to be ninth degree or closer), we observed 
particularly pronounced connections to Mary- 
land, identifying a total of 30 close relatives in 
the state (table S16). A randomization test 
confirmed that we would be unlikely to iden- 
tify 30 or more close relatives among an iden- 
tically sized sample (n = 19,972) of randomly 
selected individuals (P < 0.001) (table S17). 
These results suggest that at least some de- 
scendants of the Catoctin individuals or their 
family members remain in the Maryland area. 

Next, we considered population substructure 
among the close relatives of the Catoctin in- 
dividuals in the US using an IBD network 
approach. We analyzed 443 close relatives of 
the Catoctin individuals (230 cM of IBD) along 
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individuals. (B) IBD network demonstrating Catoctin individuals’ connections to 
members of the African cohort who share </00 cM of IBD with one another 
(N = 2807). IBD clusters (represented by circles) are filled according to 
members’ average local African ancestry and arranged by average pairwise 
IBD sharing between clusters using a Force Atlas graph layout. Catoctin 
individuals, displayed as squares, are projected on the basis of their average 
IBD shared with each cluster (shown as lines). 


with 4385 of those participants’ closest rela- 
tives who share =100 cM of IBD—many of 
whom we hypothesize could be genetically 
related to the Catoctin individuals, even if we 
did not detect IBD sharing between them 
owing to the high false-negative rate of our 
approach—and identified 123 familial groups 
(Fig. 6D and table S18). The positions of these 
familial clusters within the IBD network ap- 
pear to be primarily correlated with the rela- 
tive proportions of European and sub-Saharan 
African ancestry detected among each cluster’s 
members, likely reflecting the ancestry of the 
individual IBD segments shared with the Ca- 
toctin individuals. In most cases, the familial 
groups do not appear to be correlated with 
geography. Notably, however, all members of 
familial group 36 who provided location in- 
formation have ties to Maryland, providing 
further evidence that Catoctin descendants 
and close relatives remained in or returned to 
Maryland after emancipation. 
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Fig. 5. Genetic connections to the Catoctin individuals among members 

of the European cohort. (A) Proportion of 23andMe participants in the European 
cohort who share IBD with Catoctin individuals. Geographic coordinates are rounded 
to the nearest integer, and only coordinates that have at least 25 associated 
participants after 80% down-sampling are shown. Marker size corresponds to the 
number of participants associated with each location, while the color indicates 

the proportion of participants who share IBD with the Catoctin individuals. (B) IBD 
network indicating Catoctin individuals’ connections to the 23andMe participants in 
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the European cohort who share <700 cM with one another. Clusters are labeled by 
the geographic region with which the plurality of cluster members are associated 
using ISO2 country abbreviations and, when appropriate, prefixes to indicate the 
cardinal directions. Clusters are arranged by the average pairwise IBD sharing 
between clusters using a Force Atlas graph layout, with outlines that indicate 
participants’ average local European ancestry. Catoctin individuals, displayed as 
squares, are projected on the basis of the average IBD shared with each European 
cluster (shown as lines). 
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While elevated rates of IBD sharing are 
particularly evident in the Maryland region, 
we also identified other regions of the US 
with an enrichment of Catoctin close relatives. 
For example, the maximum amount of IBD 
(280 cM) is observed among a set of individ- 
uals from Southern California. This amount 
of sharing (corresponding to ~4% of the auto- 
somal genome) is consistent with a fifth-degree 
relationship to the individual from burial 35 
on the basis of maximum likelihood predic- 
tions (although the actual relationship may 
differ by a few degrees in either direction). 
Individuals with this amount of IBD sharing 
are likely either direct descendants of those 
buried at Catoctin or direct descendants of 
very close relatives of the Catoctin individuals 
(given that the Catoctin individuals likely lived 
at least five generations before most of the re- 
search participants included in this study) 
(Box 1 and table $7). 

To reconstruct pedigrees describing the con- 
nections shared between the Catoctin individ- 
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uals and their closest genetic relatives in the 
23andMe cohort, we used a modified version 
of the tool Bonsai (36, 43). The informed con- 
sent process for participation in 23andMe re- 
search requires strict protection of research 
participant anonymity, which means that full 
pedigrees cannot be published. Instead, we 
highlight the ways in which 8721 independent 
pedigrees (636 of which contain more than 
one research participant) are connected to Ca- 
toctin family A (Fig. 7 and table S19). We found 
no cases where the most likely connection was 
via a direct descendant of individuals from 
burials 1, 2, or 24, consistent with our expec- 
tations because these individuals died during 
childhood. The most probable path for most 
(83%) of the pedigrees we considered connects 
through an ancestor of the unsampled father 
of individuals from burials 1 and 2 (referred 
to as ungenotyped individual f in Fig. 7). We 
find that pedigrees that include research par- 
ticipants who have at least twice as much In- 


digenous American ancestry as sub-Saharan 
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African ancestry were significantly more likely 
to connect through this unsampled individual 
than through other Catoctin individuals (P < 
10° ; calculated via a permutation test with 
10° replicates) (supplementary text S6), con- 
sistent with our earlier prediction that this 
unsampled individual had some Indigenous 
American ancestry. 

Most inferred connections extend upward 
in pedigrees generated for families A, C, and 
D (meaning that they connect through an 
ancestor of one or more members of the family 
and therefore do not involve a direct-descent 
relationship) (Fig. 7, fig. $14, and table S19). 
This is consistent with expectations, as indi- 
viduals who lived in the past few hundred years 
are predicted to have far more living collateral 
relatives than direct descendants. Notably, 
lineages that extend upward from the Catoctin 
pedigrees tend to involve research participants 
who have more European ancestry than those 
lineages that extend downward. This sug- 
gests that a relatively large fraction of the 
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Fig. 6. Geographic distribution of distant and close relatives of the 
Catoctin individuals among members of the US cohort. (A) Proportion of 
23andMe research participants in the US cohort who share IBD with Catoctin 
individuals. Only coordinates representing at least 25 participants after 80% 
down-sampling are shown. Marker size corresponds to the number of 
participants associated with each coordinate, while color indicates the proportion 
of participants with shared IBD. Marker outlines indicate the number of 
participants at each coordinate who share 230 cM of IBD with one or more 
Catoctin individuals. The same information is shown for (B) participants with 


25% sub-Saharan African ancestry and (€) participants with =99% European 
ancestry. (D) IBD network of the closest present-day relatives of Catoctin 
individuals among 23andMe research participants. Circles represent modularity 
clusters consisting of close Catoctin relatives (sharing =30 cM of IBD) along with 
their relatives (sharing 2100 cM with a close relative of a Catoctin individual). 
Clusters are outlined according to their average ancestry and arranged by the 
average pairwise IBD sharing between clusters using a Force Atlas layout. 
Catoctin individuals, displayed as squares, are projected on the basis of the 
average IBD shared with each familial group (shown as lines). 


connections that we identified involve ances- 
tors of members of family A with European 
ancestry, which in part may reflect biases in 
the 23andMe cohort where European ances- 
try is overrepresented. 

These results demonstrate the power of our 
IBD-based approach to identify connections 
between historical and present-day individuals. 
In the future, by obtaining additional informed 
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consent from research participants, it may be 
possible to present more complex pedigrees 
that include direct descendants of historical 
individuals using this approach. 


Biologically meaningful variants 


We considered sites in the Catoctin individu- 
als’ genomes that might shed light on their 
physical traits and health (table S20 and fig. S15). 
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However, we caution that these results are 
based on low-coverage data and that further 
work is required to conclusively infer geno- 
types at these positions. 

For three Catoctin individuals, we identified 
copies of only the causal A allele at the genetic 
position 7s334/i3003137, which is associated 
with protection against malaria in the heter- 
ozygous form and sickle cell disease—a red 
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Fig. 7. Connections between Catoctin family A and modern pedigrees. The pedigree for Catoctin 

family A is shown with blue-shaded individuals connected by gray lines. Open gray diamonds indicate 
ungenotyped individuals, some of whom must have existed (individuals b, c, f, i, and j), whereas others 
(individuals a, d, e, g, h, and k) may have existed. The large triangle represents all present-day pedigrees 
composed of 23andMe research participants, and the probability distribution of how these pedigrees connect 
to the historical pedigree is inferred. Orange dots indicate all possible points of connection of present-day 
pedigrees to the historical pedigree. Numbers in ovals give the percentage of present-day pedigrees 
whose most likely connection was to a given point on the historical pedigree. Numbers in orange diamonds 
indicate the average degree of a lineage connecting to a particular point. Pie charts show the average 
European (blue), sub-Saharan African (purple), and Indigenous American (yellow) ancestry (normalized to 
sum to one) of individuals in pedigrees whose most likely point of connection was through the respective 


lineage leading to the present day. 


blood cell disorder that causes pain and in- 
creased susceptibility to infection and early 
mortality (44)—in individuals with two copies 
of the allele. Although we only have limited 
coverage at this position for each of these 
three individuals (range: 1 to 5x coverage), 
making it impossible to confidently determine 
their genotypes, we note that two individuals 
(burials 17 and 19) are siblings who died 
during early childhood. Sickle cell disease, 
if they had it, may have contributed to their 
early mortality. 

We also observed four individuals with at 
least one copy of the causal T allele at another 
genetic position (787050828) that provides pro- 
tection against malaria in genetically female 
individuals who carry one copy of this allele 
on only one of their X chromosomes. In ge- 
netic males, who are hemizygous, and in genet- 
ic females who are homozygous for the T 
allele at this position, it is associated with 
glucose-6-phosphate dehydrogenase (G6PD) 
deficiency (45). Both variants occur at ele- 
vated rates in populations with ancestry from 
sub-Saharan Africa, where malaria has histor- 
ically been endemic (46). Understanding how 
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the frequency of these alleles may have changed 
in African American populations over time is 
a question of great interest (47, 48), which may 
be informed by aDNA data as more historical 
African Americans are sequenced. 


Discussion 


This study demonstrates the power of genetic 
analyses to uncover previously unknown in- 
formation about the family structure and an- 
cestry of historical individuals and to connect 
them with living relatives. For Catoctin Furnace, 
this research is a critical step toward identify- 
ing a larger descendant community, one of the 
main goals expressed by stakeholders from 
CFHS, AARCH, and the two Catoctin descend- 
ant families recently identified using historical 
documents and genealogical data. Using the 
approach introduced here to identify genetic 
connections between historical individuals and 
their present-day relatives, researchers now 
have a powerful means to reconstruct these 
relationships through DNA analysis. 

At least 15 of the Catoctin individuals can be 
clustered into five genetic families, providing 
insight into the social structure of African 
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Americans at this early industrial site. Histor- 
ical records suggest that iron works enslavers 
often kept families together to benefit from a 
sustained knowledge transfer between gener- 
ations. It was also believed that this practice 
minimized the likelihood of revolt or escape 
caused by family separation (49-51). Most of 
the genetic connections we identified were 
between first-degree relatives, usually mothers 
and children. No fathers were identified, nor 
did we find families represented by three or 
more generations. There were at least 11 in- 
dividuals who appeared unrelated to others 
buried in the cemetery. These results may 
therefore indicate that, in practice, families 
did not remain together at this iron works. 
One possible explanation is that partners 
may have been sought outside of the Catoctin 
village. However, additional consideration must 
be given to how the types of genetic relation- 
ships we observed may be biased by incomplete 
excavation combined with burial patterning at 
the site, which was likely influenced by per- 
sonal choice and/or imposed religious prac- 
tices. For instance, from firsthand accounts 
recorded in journals and diaries, we know that 
some African American funerals at Catoctin 
were conducted by Moravian ministers (52, 53). 
The Moravians prescribed burial protocols in 
which families were not interred together 
but instead were buried in “choirs,” separated 
by marital status, age, and gender (54). This 
tradition introduces the possibility that married 
men were buried elsewhere in the cemetery and 
therefore were not sampled as part of this 
study (54). It was only through the coanalysis 
of aDNA and archival Moravian diaries that 
this distinctive aspect of burial and demogra- 
phy at Catoctin could be considered. 

The genetically inherited conditions identi- 
fied in this study (sickle cell anemia and G6PD 
deficiency) provide additional insight into the 
health and well-being of the Catoctin African 
Americans. Although these results are tenta- 
tive and require more refined analyses, they 
offer insight and potential future avenues by 
which to explore the remains of deceased hu- 
mans recovered from archaeological contexts. 

At the population level, the Catoctin indi- 
viduals have diverse ancestry, with clear genet- 
ic links to Africa, Europe, and the Americas. 
Most individuals have primarily sub-Saharan 
African ancestry, with the strongest ties to 
present-day peoples in Senegambia and West 
Central Africa, a region that primarily encom- 
passes present-day Angola and the DRC. These 
findings accord with historical records that 
show that slave ships originating from Sen- 
egambia (particularly before the end of the 
18th century) and West Central Africa ac- 
counted for the highest disembarkation rates 
in Maryland over the course of the transatlan- 
tic slave trade, whereas lower disembarkation 
rates were recorded from intervening coastal 


10 of 13 


RESEARCH | RESEARCH ARTICLE 


regions and the Caribbean (table S21) (55). 
However, records of the intra-American slave 
trade, which was responsible for the arrival 
of many enslaved African Americans to Mary- 
land, are more limited, making it challenging 
to infer what the most likely major sources of 
African ancestry in Maryland might have been 
on the basis of historical records alone. Ac- 
cording to the transatlantic slave trade data- 
base, ships departing from Senegambia and 
West Central Africa accounted for 15.8 and 
20.8%, respectively, of overall disembarkations 
in North America (55). Notably, the most com- 
mon sources of African ancestry among the 
Catoctin individuals do not align with larger 
trends in the greater Chesapeake region, where 
the highest rate of disembarkation was from 
the Bight of Biafra, home to people of Igbo and 
Yoruba ancestry (55). 

The Catoctin individuals were less likely to 
have genetic connections to multiple distinct 
African ethnolinguistic groups than research 
participants with substantial (250%) sub- 
Saharan African ancestry and long-standing 
ties to the US (ie., four grandparents born in 
the US), likely reflecting admixing between 
individuals with diverse African ancestries that 
occurred among enslaved African Americans 
and their descendants over time, after forced 
migration to the Americas (22, 56). 

Given the overrepresentation of research par- 
ticipants with European (particularly British 
and Irish) ancestry in the 23andMe cohort, we 
have even more power to localize the European 
ancestry of the Catoctin individuals. Eight indi- 
viduals exhibit connections to Great Britain and/ 
or Ireland. Differing frequencies of European- 
associated maternal and paternal haplogroups 
observed at Catoctin indicate that their Euro- 
pean ancestry was likely introduced through a 
sex-biased process almost certainly driven by 
rape of enslaved women as part of the gender- 
based sexual violence inherent in the US’s sys- 
tem of chattel slavery (38). We also observed a 
single European-associated mt haplogroup (in 
an individual who we estimate to have >50% 
European ancestry). This may be one of nu- 
merous examples present in the historical 
record of enslaved or free Black men having 
children with white women (often indentured 
servants) (57). 

Another objective of this study was to ex- 
plore the possibility of identifying direct de- 
scendants using DNA. We identified 41,799 
research participants with genetic connections 
to the Catoctin individuals. In many cases, 
it was possible to construct detailed genetic 
pedigrees that link 23andMe research par- 
ticipants to the historical individuals from 
Catoctin, either as direct descendants or, most 
commonly, as collateral relatives with a shared 
common ancestor. Although most of these 
connections are distant, we identified >500 
relatives who share =30 cM of IBD, reflecting 
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a maximum likelihood estimate of ninth-degree 
relationship or closer. 

It has been suggested that Catoctin’s enslaved 
workers were sold and transported to more- 
southern states when the furnace transitioned 
to white wage labor (4). During the early 19th 
century, large numbers of enslaved individuals 
were sold from mid-Atlantic states and trans- 
ported south (58). Although we observe strong 
connections to Maryland that suggest that this 
was unlikely to be the fate of Catoctin’s entire 
enslaved population, we do observe small 
clusters of close relatives throughout the US, 
including in the South. 

The methodological approach we present 
in this study can be applied to the remains of 
deceased humans from other sites and con- 
texts, offering a new scientific tool for indi- 
viduals and descendant communities seeking 
greater knowledge of their ancestors, as well 
as archaeologists, bioarchaeologists, histor- 
ians, and genealogists. Museums and univer- 
sities that steward the remains of deceased 
humans now have an additional means by 
which to identify these individuals and po- 
tentially link them to biological descendants. 
This work can currently only be done in part- 
nership with organizations with access to 
massive, genetically diverse, recontactable re- 
search cohorts, such as those maintained by 
genetic-ancestry companies. These partner- 
ships will require conversations on the ethical 
implications and consequences of this work, 
particularly how to avoid reinforcing the bio- 
logization of identity (37). For researchers, this 
study represents an advance in scientific meth- 
odology, but the impact may be even greater 
for those seeking connections to their past. 


Materials and methods summary 


We sampled aDNA from the temporal bones 
of 27 Catoctin individuals, using a minimally 
destructive cranial-based drilling approach 
when sampling from intact skulls (59). We 
generated double-stranded, partially uracil- 
DNA glycosylase-treated DNA libraries (60-64). 
Before sequencing, we enriched the libraries for 
DNA aligning to the mitochondrial genome and 
1.2 million positions in the nuclear genome 
using a capture-based approach (23-26). After 
bioinformatic processing, all 27 DNA samples 
were deemed suitable for analysis, although 
the data for two individuals were subjected to 
damage restriction because of slightly elevated 
mitochondrial (26) or X-chromosome (65) con- 
tamination rates. For each individual, we 
inferred genetic sex (66) and uniparental hap- 
logroups (67) and estimated the proportion of 
African, European, and Indigenous American- 
related ancestry by comparing against publicly 
available datasets (68-71). Diploid genotypes 
were imputed using GLIMPSE (72), with the 
1000 Genomes Project phase 3 dataset (68) 
as a reference panel. After filtering and re- 
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phasing the data (73), we searched for IBD (41) 
between the imputed Catoctin individuals and 
9,255,493 participants (elsewhere referred to 
as the 23andMe cohort) who had been geno- 
typed by 23andMe, Inc., a consumer personal 
genetics company, and provided informed con- 
sent to participate in research by 28 July 2020. 
Summary statistics were generated describing 
these IBD connections for cohorts of research 
participants that were created on the basis of 
each research participant’s genetic ancestry (as 
determined by the tool Ancestry Composition) 
and answers to 23andMe survey questions 
about their birth location and grandparents’ 
birth locations. Genetic pedigrees were recon- 
structed using a modified version of the Bonsai 
pedigree inference algorithm (43). Finally, 
we counted the number of different DNA se- 
quences that overlap biologically meaningful 
positions in the genome for each Catoctin 
individual. 
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Peptide YY: A Paneth cell antimicrobial peptide 
that maintains Candida gut commensalism 


Joseph F. Pierre*’, Brian M. Peters®, Diana La Torre’, Ashley M. Sidebottom’, Yun Tao’, 
Xiaorong Zhu’, Candace M. Cham’, Ling Wang?, Amal Kambal’, Katharine G. Harris‘, Julian F. Silva’, 
Olga Zaborina®, John C. Alverdy®, Herbert Herzog®, Jessica Witchley”*, Suzanne M. Noble’, 


Vanessa A. Leone’, Eugene B. Chang** 


The mammalian gut secretes a family of multifunctional peptides that affect appetite, intestinal 
secretions, and motility whereas others regulate the microbiota. We have found that peptide YY 
(PYY;-36), but not endocrine PYY3.3., acts as an antimicrobial peptide (AMP) expressed by gut epithelial 
paneth cells (PC). PC-PYY is packaged into secretory granules and is secreted into and retained by 
surface mucus, which optimizes PC-PYY activity. Although PC-PYY shows some antibacterial activity, 

it displays selective antifungal activity against virulent Candida albicans hyphae—but not the yeast form. 
PC-PYY is a cationic molecule that interacts with the anionic surfaces of fungal hyphae to cause 
membrane disruption and transcriptional reprogramming that selects for the yeast phenotype. Hence, 
PC-PYY is an antifungal AMP that contributes to the maintenance of gut fungal commensalism. 


he coevolution of metazoans and mi- 

crobes has been fundamental to the 

development of mutualistic beneficial 

relationships, not least in the digestive 

tract of animals. Gut microbes form 
region-specific stable and resilient commun- 
ities that are essential for processes such as 
immune and metabolic development and over- 
all intestinal homeostasis (J). Gut bacteria have 
been investigated in some detail, but less is 
known about gut fungi and archaea. The mi- 
crobiota appear to be regulated in a healthy 
host although little is known about host sur- 
veillance and control of fungal populations in 
the gut. 

Peptide YY (PYY) is a satiety hormone that 
is expressed and secreted by enteroendocrine 
cells (EECs) (2). In this work we found that 
gut-specific paneth cells (PCs) also express a 
form of PYY that functions as an antimicrobial 
peptide (AMP) with selective activity against 
Candida albicans. This fungus is found com- 
monly as acommensal yeast but becomes path- 
ogenic upon transformation into its hyphal 
form (3). The antimicrobial function of PC- 
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PYY plays a role in maintaining fungal com- 
mensalism by inhibiting the yeast-to-hypha 
transformation of opportunistic Candida spp. 


Presence of PYY observed in Paneth cells 


While visualizing mucosal enteroendocrine 
L-cells in murine distal ileum, we serendip- 
itously observed satiety-regulating peptide PYY 
immunostaining in lysozyme (LYZ1)-expressing 
PCs (Fig. 1, A and B). This finding was notable 
as PCs are gut mucosal epithelial cells found in 
most mammals, which secrete AMPs against 
pathogens and regulate the local gut micro- 
biota (4). We confirmed PYY immunolocaliza- 
tion in healthy human adult ileal PCs (fig. 
S1A), observed antibody epitope specificity 
through recombinant peptide quenching (fig. 
SI, B and C), and identified PYY mRNA in 
PCs through fluorescent in situ hybridization 
(FISH) (Fig. 1B). To achieve higher resolution, 
stimulated emission depletion (STED) and 
SP8 confocal microscopy were used to show 
that PYY and LYZ1 are packaged into discrete 
granules in the jejunum and ileum (Fig. 1, C 
and D, fig. S2, Aand B, and movie S1), raising 
the possibility that these peptides have differ- 
ent roles, sorting pathways, and regulation (5). 
Laser capture microdissection in the murine 
ileum followed by quantitative RT-PCR re- 
vealed robust PYY transcript levels in basal 
crypts in line with other PC-specific AMP 
transcripts (Fig. 1, E and F). Reanalysis of a 
publicly deposited murine small intestinal 
epithelial single-cell RNAseq dataset identi- 
fied PYY expression in mature EEC popula- 
tions that coexpressed with Scgi, Cck, and 
Glp1, but also in PCs in which it was coex- 
pressed with Dfal7, Atgi6l1, Defa5, and Lyz1 
(fig. S3, A and B) (6). The human protein atlas 
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PYY, showing detection in PCs (fig. 7 —— 


Antimicrobial functions of Paneth cell PYY 


PC expression of PYY indicated it might have 
an antimicrobial function. The predicted struc- 
ture of PYY resembles the alpha-helical, am- 
phipathic AMP, magainin-2 (Fig. 2, A and B), 
produced in the skin of Xenopus laevis. PC-PYY 
is strictly the unmodified, full length PYY;.35 
whereas the circulating endocrine form, PYY3 36, 
is formed by removal of two N-terminal amino 
acids by dipeptidyl peptidase IV (DPP-IV) (7). 
We initially assessed the antimicrobial activity 
of full length PYY peptide against represen- 
tative gram-positive and -negative bacteria, 
which showed less activity compared with more 
effective magainin-2 (Fig. 2, L to N). We then 
tested full length PC-PYY on C. albicans growth 
and viability (8, 9). We induced transition of 
yeast into hyphal forms in the presence of 
2.5% w/v porcine mucin. Propidium iodide (PI) 
was used to measure membrane permeabili- 
zation of PYY;3.6-treated C. albicans hyphae. 
Permeabilization of hyphae was observed at 
30 uM PYY,.36 but yeast was unaffected (Fig. 
2C and fig. S4A). PYY,.3¢-fluorescein isothio- 
cyanate (FITC)-tagged peptide colocalized with 
the PI signal within hyphae (Fig. 2C). We syn- 
thesized PYY using D-amino acids to generate 
a peptide stereoisomer, which disrupted hyphae 
in the same way as the native L-isoform (fig. 
S4B). Less pronounced permeabilization was 
observed with the shorter endocrine PYY3.36 
(fig. S4, B and C). Similar staining with PI was 
found with hyphal forms of C. tropicalis and 
C. dubliniensis treated with PYY1.36 (fig. S5). 

Given the observed hyphal specificity, we 
subjected C. albicans yeast and hyphae to in- 
creasing PYY;.35 concentrations in media with 
or without 2.5% w/v mucin. PYY;3¢ did not 
affect yeast numbers or density (Fig. 2, D and 
E); however, it elicited a decrease in hyphal 
respiration (Fig. 2F). Similar results were ob- 
tained for C. tropicalis (fig. S6). The enter- 
oendocrine form, PYY3.35, showed less activity 
(fig. S7A). Hyphal growth and survival were 
examined under increasing concentrations 
of PYY,36, where magainin-2 (positive) and 
scrambled (negative) peptides were used as 
controls. In aqueous conditions, hyphal growth 
and biofilm formation were significantly re- 
duced at 20 to 50 uM PYY}.36 (Fig. 2, G and H). 
We observed a biphasic dose-response of 
PYYj;.3g action in aqueous (RPMI) buffer, simi- 
lar to observations of other AMPs, with 
peak effect at ~30 uM and loss of activity by 
100 uM (10, 11). We attribute this observation to 
concentration-dependent self-aggregation, or 
multimerization, of amphipathic alpha-helical 
AMP precipitates in aqueous medium, reduc- 
ing bioactivity. Consistent with this hypothesis, 
a sigmoidal dose-response curve was observed 
in biofilm and colony-forming units (CFUs) when 
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Fig. 1. PYY localizes to ileal Paneth cells. (A) PYY (red) detection in in PCs extractions from ileal crypts and villus epithelial cells through laser-capture 
(red arrow) and L-cells (white arrow) through anti-PYY antibody IF. PC PYY microdissection (LCM). (F) Relative expression of PYY and marker mRNA for 
staining colocalized (right panel) with PC Lysozyme (LYZ, green; middle panel). crypt secretion compared with villus epithelium secretion estimated from 
(B) Confirmation of PC PYY localization through mRNA FISH (red, left panel) LCM. Lysozyme and Cryptdin-1 are AMPs that are also secreted from PCs; 
and counterstaining with anti-LYZ antibody (green, right panel). (C) High- Sucrase-isomaltase is released from the villus surface brush border; Neurotensin 
resolution stimulated emission depletion IF microscopy of PC PYY and LYZ is a marker for enteroendocrine cells. Significance was determined using a 
packaging in discrete secretory granules within the cytosol. (D) SP8 confocal t-test (n = 6, repeated twice, *P < 0.05; **P < 0.01; ***P < 0.001; 
microscopy of PYY (green) and LYZ (red) in ileum PCs. (E) Representative ****P < (0001). 
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Fig. 2. PYY displays antimicrobial activity toward Candida albicans hyphae and 
some bacteria. (A) Ribbon diagram (top) and space-filling model indicating electrostatic 
surface charge (bottom; red negative, blue positive) for amphipathic alpha-helix structures 
of PYY}.36 and Magainin-2. (B) Amphipathic PYY;.3. helical wheel projection displaying 
surface localization of residues and calculated hydrophobic dipole moment (uH 0.208, 
arrow). (C) Propidium iodide (PI) staining of C. albicans yeast and hyphae following 
exposure to PYY;.3¢-FITC (green) in standard antimicrobial peptide assay buffer (AMP), 
preferred growth media for each form (YPD-yeast or RPMI-hyphae), or standard buffer 
+2.5% w/v porcine mucus. C. albicans yeast (D) survival [colony-forming units (CFUs)] 
and (E) growth [Optical density (OD)] following exposure to PYY;.3. + mucus, PYY 
scramble, and Magainin-2 peptides. Pathogenic C. albicans hyphal (F) respiration by 
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tetrazolium salt (XTT) assay (G) dry weight (H) biofilm adherence and (I) CFUs following 
exposure to PYY;36 + mucus, PYY scramble, and Magainin-2 peptides. (J) Transmission 
electron microscopy for visualization of cationic surfaces (black probe accumulation) in 
C. albicans yeast and hyphae membranes. (K) Scanning electron microscopy of C. albicans 
yeast and hyphae following 2-hour exposure to vehicle (H20) or PYY;.3¢. Dose-dependent 
killing of gram-positive (L) and gram-negative (M) bacteria induced by PYY;.2¢. CFUs 
remaining were normalized to 0 uM PYY. Significance was determined through one way 
analysis of variance (ANOVA). (N) Impact of varying concentrations of PYY).3. + mucus 
versus Magainin-2 on bacteria growth. OD values were normalized to wells containing 

O uM PYY. All assays were performed in duplicate three times. Significance was 
determined through ANOVA (*P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001). 
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2.5% w/v mucin was present (Fig. 2, H and I). 
No differences were observed for biofilm with 
PYY3.36 (fig. S7B). We surmise that mucin pre- 
vents self-aggregation of PYY,36, which allows 
its activity to be maintained. 

To test whether the cationic properties of 
PYY;.36 (as shown in Fig. 2, A and B) allow 
binding to the anionic surface charge of fungi 
(5, 12) we used a cationic ferritin probe and 
transmission electron microscopy to identify 
sites of anionic surface charge (Fig. 2J, white 
arrows). The cationic probe decorated the sur- 
face of hyphae whereas the surfaces of yeast 
cells were not labeled. To further investigate 
charge interactions, we repeated PI membrane 
permeability assays with PYY,.35 in the pres- 
ence of sodium sulfate, to provide anionic 
quenching of cationic PYY;36 charge, which 
reduced PYY}.36-induced permeability (fig. 
S4D). Finally, scanning electron microscopy of 
yeast and hyphal forms of C. albicans showed 
that hyphal membranes exposed to PYY}.36 
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develop surface blebbing and irregularities 
whereas yeast membranes remain unaffected 
(Fig. 2K). 

RNA-seq was used to measure the transcrip- 
tional response to PYYj.35. The hyphal pheno- 
type showed down-regulation of cell wall 
synthesis, biofilm formation, and ribosome 
biogenesis pathways (fig. S8, A and B). By 
contrast, yeast cells showed fewer changes 
in gene expression and those mostly affected 
were genes involved in yeast-to-hypha tran- 
sition and adhesion. In all, our evidence indi- 
cates that PYY,35 has selective antimicrobial 
activity against the virulent, invasive hyphae 
of C. albicans, with limited effect on the 
blastoconidia. 


Paneth cell PYY release and mucus localization 


PCs secrete AMPs into the gut lumen in re- 
sponse to the presence of microbial products 
(5). Although PYY,35 is produced by L-cells 
where it is released systemically, the ubiqui- 


tous serine protease DPP-IV rapidly cleaves 
it into endocrine PYY3.35. We used ex vivo 
murine distal ileal loops to examine luminal 
PYY release (fig. S9, A and C). Following ex- 
posure to C. albicans, PYY1.35 was measured 
by LC-MS (fig. S10) in the lumen contents, 
the surface mucus, and mucosal tissue (Fig. 3A). 
At baseline, micromolar amounts of PYY}.36 
were detected in mucus following control in- 
cubations in this model (fig. S11). However, 
introduction of C. albicans hyphae into ileal 
loops significantly increased PYY}.36 levels. 
Less PYY;.36 was observed when yeast were 
present or when cell-free spent media were 
applied from cultures of either fungal mor- 
phology. PYY3.35 was not detected in any of 
the lumen contents, mucus, or tissue com- 
partments by MS, similar to previous findings 
(13, 14). These data indicate that PC-PYY is 
released into overlying mucus where it is re- 
tained specifically in response to the presence 
of C. albicans hyphae. We confirmed this 
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Fig 3. Exposure to C. albicans hyphae but not yeast enhances PYY localization 
into ileal mucus. (A) PYY;.3¢ quantification using liquid chromatography- 
electrospray ionization-mass spectrometry (LC-ESI-MS) in lumen (L), mucus 

(M), and tissue (T) of ex vivo ileal loops (n = 6 per treatment) and in (B) mucus 
of in vivo ileal loops (n = 5 per treatment) following stimulation with vehicle 
(Control), C. albicans hyphae (+Hyphae), C. albicans yeast (+Yeast), hyphae 
conditioned media (CM), supernatant (+HypS), or yeast CM supernatant (+YS). 
(C) Representative staining and quantification of PC PYY protein (red) following in 
vivo ileal loop stimulation with vehicle control, +Hyphae,.+Yeast, +HypS, or +YS. 
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Significance was measured using ANOVA and Dunnett's Multiple Comparisons Test; 
**P < 0.01, ***P < 0.005, ****P < 0.001. (D) PYY mucus versus aqueous 
partitioning in vitro. PYY;3¢ was added to either mucus (1) or aqueous (2) wells 
separated by a 1- kDa MWCO filter. Significance was measured using t-test; 

***P < 0.001. (E) Dipeptidyl peptidase IV (DDP-IV) IF in murine ileum brush 
border. Lectin UEA-I-FITC (Ulex europaeus) counterstain, ileal mucus; DAPI, nuclei. 
(F) PYY;.36 and PYY3-3¢ exposure to DPP-IV + mucus to assess peptide degradation 
through LC-ESI-MS (n = 3 per treatment), expressed as a percent of total 

peptide. All experiments were done in triplicate with 3 replicate experiments. 
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Fig. 4. PYY reduces gastrointestinal colonization of C. albicans in vitro and 

in vivo. (A and B) C. albicans hyphae abundance (Hyphae-GFP) on confluent Caco2 
cells exposed to vehicle (Control) or PYY}.36 (20 um) after 6 hours. (C) Fungal CFUs 
in WT C57BL/6 mice + exogenously administered PYY;.36 or scrambled peptide 
through oral gavage 8 days post C. albicans (4 x 10° CFU) challenge (n = 8 per group, 
*P < 0.05 versus control). (D) C. albicans colonization (4 x 10° CFUs) in PYY-KO 
relative to WT mice (n = 8 to 9 timepoints per group). (E) 72-hour PYY-KO and WT 
mouse survival after gavage with C. albicans (2 x 10’ CFU; n = 8 per group, repeated 
twice with both males and females) under cefoxitin and clindamycin (2 WT were 
humanely euthanized). (F) CFUs recovered from in small intestinal regions of PYY-KO 
and WT animals gavaged with C. albicans (2 x 10’ CFU). (G) Intestinal mucosal 
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Lumen Mucus Lumen Mucus 


gross morphology remained intact whereas (H) epithelial cell apoptosis (red, TUNEL; 
white asterisk) and (I) virulent Candida morphology (hyphae; white arrows) was 
increased versus yeast in the mucosa of PYY-KO versus WT animals (36.3 + 12.1 
versus 11.77 + 3.8%, P < 0.04) following oral gavage challenge under clindamycin 

(2 x 10’ CFU/mouse; n = 8 per group; repeated twice with both males and fernales: 
red indicates Candida). (J) Differentially altered fungal (top panel, n = 39 samples) 
and bacterial (bottom panel, n = 46 samples) populations from small intestinal mucus 
and lumen of PYY-KO versus WT animals without Candida challenge as determined 
through Wilcoxon rank test (between groups) and Kruskal-Wallis test (across groups) 
*P < 0.05, **P < 0.01, ***P < 0.001. To account for uneven sequencing depth, 

data were transformed into relative abundances based on total sum scaling. 
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response using in vivo ileal ligated loops (Fig. 
3B). Immunofluorescent studies confirmed in 
vivo depletion of PC PYY in the presence of 
hyphae (Fig. 3C and fig. $12). 

We used a 10-kDa MWCO filter to separate 
mucus and aqueous compartments to test 
whether the cationic amphipathic properties 
of PYY,36 have affinity to mucus (Fig. 3D). We 
found that PYY,3, preferentially partitions 
into mucus, likely through charge interactions, 
hence explaining its retention on the mucosal 
surface. Because DPP-IV is present in the 
mucosal brush border (15) (Fig. 3E), we ques- 
tioned why PYY3.35 was not detected in over- 
lying mucus. We incubated DPP-IV and PYY13.5 
with and without mucin, which showed PYY3.36 
generation in the absence of mucin but none 
when it was present, suggesting proteolytic pro- 
tection of PYY;36 within mucus layers (Fig. 3F). 


The effect of PYY on Candida at the 
intestinal mucosa 


Green fluorescent protein-tagged C. albicans 
hyphae (fig. S13) were cocultured with and 
without PYY,35 in human epithelial Caco-2 
cells. In the presence of PYY}.3¢, the numbers 
of hyphae attached to Caco-2 cells were sig- 
nificantly reduced (Fig. 4, A and B). 
Antibiotic-induced chronic C. albicans intes- 
tinal colonization was established through oral 
gavage (4 x 10° CFU) in specific pathogen-free 
wild-type (WT) or PYY gene-deficient (PYY-KO) 
mice and used to investigate the effect of PYY 
on fungal populations in vivo. Oral gavage of 
colonized WT mice with PYY,.3. resulted in 
decreased fungal titers in stool and intesti- 
nal contents as compared with mice given a 
scrambled peptide control (Fig. 4C and fig. 
S14). Conversely, fungal colonization of PYY- 
KO mice was 2 to 3 times higher than in WT 
counterparts (Fig. 4D). Total fecal output re- 
mained similar between groups, indicating that 
comparable gut motility was maintained. A 
second intestinal colonization challenge ex- 
periment was performed in PYY-KO and WT 
animals with a higher C. albicans inocula- 
tion (2 x 10’ CFU) under systemic and oral 
antibiotics to induce greater virulence. Here, 
PYY-KO animals exhibited elevated small in- 
testinal C. albicans counts, more pronounced 
mucosal apoptosis, and significantly greater 
hyphal versus yeast (36.3 + 12.1%) forms at 
the mucosal surface relative to WT counter- 
parts (11.77 + 3.8%, P < 0.04) (Fig. 4, E to D. 
Given the current evidence that full-length 
PC-PYY localizes to the mucus layer, we iso- 
lated lumen- and mucus-associated samples 
from PYY-KO and WT littermates for myco- 
biota characterization under standard colony 
conditions. Young animals were used as PYY 
KO animals develop insulinemia and weight 
gain by 14 weeks of age, which could itself con- 
found mycobiome results (2). Compared with 
the variable lumen fungal communities from 
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either WT or PYY-KO, the mucus-associated 
layer harbored a distinct fungal population en- 
riched for Candida (Fig. 4J and fig. S15, A to 
C). Compared to WT counterparts, PYY-KO ani- 
mals displayed elevated relative abundance of 
Candida and other Saccharomycetaceae, with- 
in the mucus (Fig. 4J, top panel, and fig. S15C). 
By contrast, bacteriome analysis showed minor 
relative abundance changes in Ruminococcus, 
Coprococcus, Lachnospiraceae, and Prevotella, 
as well as significantly altered beta diversity 
(Fig. 4, bottom panel, and fig. SI5B). No dif- 
ferences were observed in EEC or PC density 
between genotypes (fig. S16). 


Discussion 


Fungi are normally present in the “healthy” 
gut microbiome. The yeast Candida albicans 
is found in 70% of humans (6) but can tran- 
sition into an opportunistic pathogen (17). PYY 
is highly conserved in vertebrate species (18) 
and its only known function to date is endo- 
crine (19, 20). Here, we report that PC-PYY 
provides functionally specific AMP activity 
against the invasive hyphal morphotype of 
C. albicans with limited antibacterial activ- 
ity. Moreover, PC-PYY is packaged into secre- 
tory granules, distinct from those carrying 
Lyz1, which are released if mucosal surfaces 
are exposed to fungal hyphae but not if ex- 
posed to yeast. The specificity of cationic PC- 
PYY appears to lie in its electrostatic inter- 
action with the anionic surface charge of 
C. albicans hyphae, determined by cationic 
probe imaging of the hyphal and yeast sur- 
face, and charge quenching experiments with 
sulfate, which inhibited PC-PYY activity. Once 
secreted, PC-PYY is retained in the overlying 
mucus of the gut lumen. Finally, we show 
that PC-PYY drives transcriptional program- 
ming in C. albicans hyphae consistent with 
cell death and down-regulation of virulence, 
whereas commensal yeast respond by down- 
regulating pathways that promote the yeast-to- 
hypha transition. Thus, microbial selectivity, 
activation, and mucus compartmentalization of 
PC-PYY distinguishes it from other AMPs, 
which have broader ranges of activity against 
microorganisms (4, 21). 

Our studies show that PYY is active against 
virulent forms of C. albicans and less so on 
other commensal microbes. We observed 
similar membrane-disrupting effects of PC- 
PYY in other species of Candida including 
C. dubliniensis and C. tropicalis, but not the 
respiratory tract pathogen Aspergillus fumigatus, 
leading us to conclude that these observa- 
tions may be specific to intestinal fungal path- 
obionts. This property arguably plays a role in 
maintaining the gut mycobiome in a state of 
commensalism. This finding complements re- 
cent work showing anti-virulence properties 
of intestinal mucin glycans (22) and specific 
mucosal IgA responses to C. albicans hyphae, 
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which similarly inhibit virulence and maintain 
commensalism (23). 

This study has several important limitations. 
First, our in vivo model used a global PYY KO, 
which has disrupted endocrine PYY functions 
that could confound results compared with a 
PC specific PYY mutant. To limit this, young 
animals were used prior to developing the 
metabolic disruption caused by PYY knockout. 
Second, although PYY is found in several 
single cell RNAseq databases, not all report its 
presence, suggesting that expression is at low 
abundance or is regionally variable, consistent 
with our data. Finally, we employed in vitro 
PYY concentrations of 20 to 30 uM, whereas 
PYY was found to be 8 to 10 uM in intestinal 
mucus at baseline but becomes elevated upon 
C. albicans challenge. 
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Human STING is a proton channel 


Bingxu Liu’?>*+, Rebecca J. Carlson’*+, Ivan S. Pires*+, Matteo Gentili't, Ellie Feng”, 
Quentin Hellier’, Marc A. Schwartz*®7*, Paul C. Blainey'*°, Darrell J. Irvine**, Nir Hacohen??* 


Proton leakage from organelles is a common signal for noncanonical light chain 3B (LC3B) lipidation 
and inflammasome activation, processes induced upon stimulator of interferon genes (STING) activation. 
On the basis of structural analysis, we hypothesized that human STING is a proton channel. Indeed, 
we found that STING activation induced a pH increase in the Golgi and that STING reconstituted in 
liposomes enabled transmembrane proton transport. Compound 53 (C53), a STING agonist that binds 
the putative channel interface, blocked STING-induced proton flux in the Golgi and in liposomes. 
STING-induced LC3B lipidation and inflammasome activation were also inhibited by C53, suggesting that 
STING’s channel activity is critical for these two processes. Thus, STING’s interferon-induction 
function can be decoupled from its roles in LC3B lipidation and inflammasome activation. 


WURORS: SERA RAE 


timulator of interferon genes (STING) is 

a conserved mammalian cytoplasmic re- 

ceptor that is essential for sensing cyclic 

dinucleotides derived directly from bac- 

teria (2) or synthesized by cyclic GMP-AMP 
(cGAMP) synthase (cGAS) upon recognition of 
cytosolic DNA (2, 3). Upon binding to its native 
ligand, cGAMP, STING undergoes a confor- 
mational change and translocates from the 
endoplasmic reticulum (ER) to the Golgi and 
endosomes, where it carries out multiple biolog- 
ical functions, including interferon induction 
(4), noncanonical light-chain 3B (LC3B) lipida- 
tion (5), and NOD-like receptor family pyrin 
domain-containing 3 (NLRP3) inflammasome 
activation (6). Whereas interferon is induced by 
STING-mediated activation of TANK-binding 
kinase 1 (TBK1) and interferon regulatory factor 
3 (IRF3) (7, 8), the mechanisms by which STING 
activates noninterferon functions, in particu- 
lar noncanonical LC3B lipidation and inflamma- 
some activation, are still unclear. 

STING induces focal adhesion kinase 
family interacting protein of 200 kDa (FIP200)- 
independent noncanonical LC3B lipidation, 
which involves conjugation of autophagy-related 
protein 8 (ATG8) to single membranes (CASM) 
(9, 10). This process, sometimes termed “non- 
canonical autophagy” (17), is important for bac- 
terial control (72) and is known to be initiated by 
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ion release into the cytoplasm from acidic or- 
ganelles (such as Golgi and endosomes) through 
multiple mechanisms, including organelle mem- 
brane damage (12), pathogen-derived ion chan- 
nels such as the influenza matrix-2 (M2) protein 
(13), or proton ionophores (17). This led us to ask 
whether proton leakage from organelles is also 
involved in STING-induced LC3B lipidation and, 
if so, how STING activation leads to such ion 
transport. 


STING activation results in a pH increase 
in the Golgi 


To test whether STING activation leads to 
proton transport out of acidic compartments, 
we constructed genetically encoded ratiomet- 
ric pH sensors targeted to several organelles. 
As a sensor, we used superecliptic pHluorin 
(SEP), a variant of green fluorescent protein 
whose brightness increases with pH (J4, 15), 
fused to pH-insensitive mRuby3. This ratiometric 
sensor was targeted to the cis/medial Golgi 
[through fusion to alpha-1,3-mannosyl-glyco- 
protein 2-beta-N-acetylglucosaminyltransferase 
(MGAT)], the trans Golgi [through fusion to 
galactose-1-phosphate uridylyltransferase (GALT)], 
or endolysosomes [through fusion to lysosomal- 
associated membrane protein 1 (LAMP1)] (J6). 
These sensors were expressed in human BJ1 
fibroblasts, and SEP-to-mRuby3 fluorescence 
ratios were correlated with intracellular pH 
values by using calibration data (fig. S1, A and B). 
Upon treatment with both the positive-control 
vacuolar adenosine triphosphatase (V-ATPase) 
inhibitor bafilomycin Al (BafA1) and the STING 
agonist diamidobenzimidazole (diABZI), we 
observed that the ratio of SEP-to-mRuby3 fluo- 
rescence increased in both the cis/medial- and 
trans-Golgi compartments (Fig. 1, A and B, and 
fig. SIC). By contrast, in endolysosomal compart- 
ments, a pH increase was observed upon BafAl 
treatment but not upon diABZI treatment (fig. 
S1, D and E). However, SEP has a pK ~7.1 (15), 
and our own pH calibration data showed low 
sensitivity to changes in pH <6.5 (fig. S1, A and 
B), so it remains possible that STING activa- 


that is below the sensor’s limit of detectio.— 


No known transporters mediate STING- 
induced LC3B lipidation 


We next sought to systematically identify 
genes that mediate the pH increase observed in 
the Golgi compartment upon STING activation. 
Given that noncanonical LC3B lipidation is 
activated by proton leakage from acidic organ- 
elles, we reasoned that screening for genes 
that modulate STING-induced LC3B lipidation 
would also identify potential channel proteins 
responsible for the observed proton flux. We 
therefore carried out a genome-wide CRISPR 
fluorescence-activated cell sorting (FACS) 
screen, using human embryonic kidney 293T 
(HEK293T) cells transduced to express the 
autophagy-associated protein LC3B fused to 
red fluorescent protein (RFP) and hemagglu- 
tinin (HA)-tagged STING. To reduce the back- 
ground lipidated LC3B signal derived from 
basal canonical autophagy, we knocked out 
FIP200 (70). After transduction with the Brunello 
genome-wide lentiviral library (77), cells were 
stimulated with the STING agonist diABZI 
and permeabilized to remove LC3B that was 
not lipidated, further reducing background 
fluorescence (18). STING-HA* cells were sorted 
into LC3B™ and LC3B* bins (Fig. 1C) to spe- 
cifically identify STING-induced LC3B lipid- 
ation regulators that did not impair STING 
expression. The screen showed strong techni- 
cal reproducibility (fig. S1, F and G, and table S1) 
and identified critical STING-induced LC3B lip- 
idation regulators, including most of the known 
V-ATPase components, as well as noncanonical 
autophagy factors such as autophagy-related 
16-like 1 (ATGI6L1) (Fig. 1D). As a general 
mechanism of STING-induced LC3B lipidation, 
V-ATPase senses proton leakage from acidic 
vesicles through recruitment of V1 subunits to 
VO complexes that together act as a scaffold for 
recruitment of ATGI6L1, which initiates LC3B 
lipidation, through a process that is indepen- 
dent of V-ATPase’s proton-pumping function 
(9, 11, 12). Despite the high recovery rate of 
V-ATPase components and noncanonical au- 
tophagy factors, no other known channel protein 
perturbation significantly inhibited STING- 
dependent LC3B lipidation in our screen (Fig. 
1D). We thus hypothesized that STING itself 
may mediate the observed Golgi proton leak- 
age and thereby trigger V-ATPase assembly 
and subsequent recruitment of ATGI6L1 to 
initiate LC3B lipidation (12). 

To identify the domains of STING involved 
in LC3B lipidation, we first tested whether the 
STING ligand-binding domain (LBD), which 
has been proposed to recruit LC3B through its 
LC3-interacting region (LIR) motifs (5, 19), could 
induce LC3B lipidation upon translocation to 
Golgi or endosomes. We measured LC3B lipid- 
ation in 293T cells expressing wild-type (WT) 
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Fig. 1. STING activation leads to a pH increase in the Golgi, and a 


\V-ATPase, noncanonical autophagy components, and known ion transporters 


genome-wide screen for regulators of STING-induced LC3B lipidation did 
not identify transporters that could mediate this effect. (A) Representative 
images of BJ1 cells expressing a ratiometric SEP and mRuby3 reporter localized 
to MGAT or GALT at 0 and 60 min after 1 uM diABZI or 1 uM BafAl stimulation. 
Scale bar, 20 um. (B) Quantification of experiment in (A); data were combined 
from three independent biological replicates. The pH was predicted with the 
linear regression model in fig. SIB. The shaded region denotes SD. One-way 
analysis of variance (ANOVA) followed by Tukey's post hoc test at 60-min time 
point: ****P < 0.0001. (C) Workflow for the genome-wide CRISPR screen. 

(D) Volcano plot of genome-wide CRISPR screen results across two replicates; 


(GO:0015075, ion transmembrane transporter activity) are highlighted. NT 
indicates nontargeting control single-guide RNAs. FDR, false discovery rate. 
(E) STING-mNeonGreen (mNG) constructs and representative images of 
STING mNG localization in 293T cells expressing WT STING, STING AQQA, or 


TMEM192-STING-LBD and stimulated with dimethyl! sulfoxide (DMSO) or 1 uM 
diABZI for 1 hour. Scale bar, 10 um. One representative experiment of n = 2 
experiments. (F) Immunoblotting of phosphorylated STING (pSTING) and 


LC3B lipidation in 293T cells expressing WT STING, STING AQQA, or TMEM192- 
STING-LBD stimulated as in (E). One representative experiment of n = 3 
experiments. 


STING, a STING oligomerization-deficient vari- 
ant (A277Q/Q273A STING, termed “AQQA”) (20), 
or an endolysosome-localized STING fusion pro- 
tein (the endolysosomal protein TMEM192 
fused to the STING LBD) (27). After stimulation 
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with the STING agonist diABZI, the AQQA 
variant exhibited impaired translocation, phos- 
phorylation, and LC3B lipidation (Fig. 1, E 
and F). By contrast, TMEM192-STING-LBD 
did not induce LC3B lipidation despite its 


endolysosomal localization and strong induc- 
tion of STING phosphorylation (Fig. 1, E and F). 
Because translocation of the STING LBD domain 
was not sufficient to induce LC3B lipidation, 
we hypothesized that STING’s transmembrane 
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Fig. 2. A pore-binding small molecule inhibits pH increase in cells 
stimulated with STING agonist, and STING transports protons in an in 
vitro liposome assay. (A) Predicted pore for chicken cGAMP-bound STING but 
not the apo conformation; key parameters were calculated with MOLEonline. 
(B) Quantification of pH increase in BJ1 cells from 0 to 60 min after 1 uM diABZI 
or 1 uM BafAl stimulation with or without 10 uM C53; data from three biological 
replicates were combined. The shaded region denotes SD. One-way ANOVA 
followed by Tukey's post hoc at the end point measurements: ***P < 0.001; 
****P < 0.0001; n.s., not significant, P > 0.05. (€) Representative images of 
BJ1 cells in (B) and fig. S2b. Scale bar, 20 um. (D) Quantification of super- 
resolution Airyscan images of BJ1 MGAT SEP mRuby3 STING knockout cells 
overexpressing STING WT miRFP680 stimulated with 1 uM diABZI, representing 
four biological replicates and five individual cells. STING intensity was normalized 
to the per-cell baseline median intensity. Shaded region denotes SD. Two-tailed 


domain could play an important role in LC3B 
lipidation upon STING translocation. 


STING-mediated pH increase is inhibited by a 
small molecule binding a predicted pore in the 
transmembrane domain 


Given the necessity of STING translocation 
from the ER to the Golgi for STING-induced 
LC3B lipidation and the known role of a pH 
increase in acidic organelles as a common 
trigger for this process, we considered whether 
STING could generate Golgi ion leakage by 
inducing membrane damage, resulting in a 
secondary ion leakage, or by directly acting as 
an ion channel through its transmembrane do- 
main. STING translocation is known to induce 
LC3B lipidation without formation of galectin- 
3 puncta (9), suggesting that STING activation 
does not result in membrane damage. We thus 
investigated whether STING directly acts as a 
channel for proton release into the cytosol upon 
translocation to the Golgi, an acidic compart- 
ment (J6). STING-dependent induction of LC3 
lipidation is an ancestral function of the sensor 
conserved from Homo sapiens to Nematostella 
vectensis (5). Thus, if STING functions as a 
channel, this activity should be structurally 
conserved. 

To investigate whether STING could func- 
tion as an ion channel, we analyzed published 
cryo-electron microscopy (cryo-EM) structures 
of chicken STING (20) with MOLEonline, a tool 
for automated detection and characterization 
of channels in macromolecules (22). When we 
analyzed the cryo-EM structures of ligand-free 
STING [Protein Data Bank (PDB) structure: 
6NT6] compared with STING bound to its 
native ligand, cGAMP (PDB structure: 6NT7), 
the tool suggested a pore spanning the lipid bi- 
layer (1.3-A bottleneck radius, 29.9-A length) 
in ligand-bound STING that was absent in 
ligand-free apo STING; the latter showed a 
central cavity that did not span the whole 
membrane (Fig. 2A and fig. S2A). A recently 
discovered STING agonist, compound 53 (C53) 
(23), binds to the STING transmembrane do- 
main in the area of the putative pore. We hy- 


pothesized that C53 could be used as a tool 
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for inhibition of the proposed ion-channel func- 
tion of STING. Indeed, the STING-mediated 
Golgi pH increase observed upon treatment 
with agonists diABZI or cGAMP alone was sig- 
nificantly reduced when cells were cotreated 
with C53 along with diABZI or cGAMP (Fig. 2, 
B and C; fig. S2B; and movie S1), which is con- 
sistent with the idea that the proposed pore 
region of STING is required to produce a pH 
change within the Golgi. To exclude a potential 
role for downstream noncanonical autophagy 
factors in mediating the observed pH increase, 
we knocked out ATGI6L1, which is essential 
for STING-induced LC3B lipidation (0), in BJ1 
pH reporter cells. As expected, we found no 
significant inhibition of STING agonist-induced 
Golgi-pH increase in these cells (fig. $2, C and 
D). To further confirm STING’s mediation of the 
observed pH increase, we knocked out endoge- 
nous STING in BJ1 cells expressing the cis/ 
medial-Golgi pH reporter (fig. S2E) and over- 
expressed STING-miRFP680. We then stimu- 
lated these cells with diABZI and used live-cell 
super-resolution Airyscan imaging to image 
STING translocation to individual Golgi vesi- 
cles. STING preferentially translocated to cis/ 
medial-Golgi vesicles with a pH increase, indi- 
cated by higher SEP relative to the mRuby3 
signal (Fig. 2, D and E, and movie 82). Quanti- 
fication revealed an increase in STING signal 
over time in vesicles with a high SEP-to-mRuby3 
ratio, with little change in STING intensity at 
vesicles with a low SEP-to-mRuby3 ratio (Fig. 
2D), as well as an increase in the correlation 
between STING- and vesicle-SEP-to-mRuby3 
ratio over time (fig. S2F), as expected if STING 
translocation to acidic Golgi compartments 
induces proton leakage. 


STING reconstituted on liposomes 
transports protons 


To explore the sufficiency of STING for me- 
diating proton transport, we purified full-length 
human STING (fig. S2G) (20) and reconstituted 
STING into liposomes (24-26) (fig. S2H). We 
used the pH-sensitive dye 9-amino-6-chloro-2- 
methoxyacridine (ACMA) to measure proton 
flux into liposomes. ACMA will be sequestered 


Student's t test at the end point measurements: ****P < 0.0001. (E) Repre- 
sentative super-resolution Airsycan images of BJ1 cell from (D) at 0 min and 

30 min after 1 uM diABZI stimulation. Scale bar, 10 um; inset scale bar, 1 ym. 
(F) Schematic of the ACMA-based fluorescence flux assay. (G) ACMA-based 
fluorescence influx assay performed using preformed liposomes loaded with 
STING protein (protein:lipid at a 1:200 mass ratio) or matched detergent 
micelle containing buffer (Control). Loaded liposomes were treated with DMSO, 
100 uM C53, or 1 uM diABZI. One representative experiment of n = 4 
experiments carried out with two distinct batches of purified STING protein. 
Error bars indicate SD. Two-way ANOVA followed by Tukey's post hoc at the 
end point measurements: ****P < 0.0001; n.s., P > 0.05. For multiple 
comparisons, only “STING + diABZI" versus “STING + DMSO” and “Control + 
DMSO” versus “Control + C53” have n.s. P value; comparisons between other 
groups all have P < 0.0001. 


in liposomes and its fluorescence quenched 
upon pH changes induced by proton transport 
from the external buffer into the vesicles, enabl- 
ing quantification of proton flux on the basis of a 
reduction in total ACMA fluorescence (Fig. 2F). 
Proton flux was observed in STING proteo- 
liposomes and was reduced in the presence of 
C53, whereas control liposomes formed with 
identical solutions devoid of protein did not 
show proton flux (Fig. 2G). Thus, STING appears 
to be sufficient to transport protons across lipid 
membranes. To further control for potential 
effects of detergent, we removed detergent with 
Bio-Beads (Bio-Rad), which did not reduce pro- 
ton flux by STING proteoliposomes but did 
reduce proton leakage induced by addition of 
high (30 times as much as the standard amount) 
(fig. S21) detergent concentrations (fig. S2J). In 
contrast to the behavior in live cells, where 
STING-mediated proton leakage was induced 
by STING agonists such as diABZI or cGAMP, 
in liposomes, STING mediated proton leakage 
similarly in the presence or absence of diABZI 
(Fig. 2G). The dispensability of diABZI for pro- 
ton flux in this reductionist liposome assay 
suggests that a voltage difference or pH gradient 
(such as found in the Golgi) could induce an 
open conformation of STING and enable proton 
transport. Agonist binding in cells would thus 
appear mainly to be required for translocation 
of STING to this acidic organelle. By contrast, 
C53 directly reduces STING-driven proton trans- 
port in vitro. 


STING’s channel activity is required for its 
induction of LC3B lipidation 


Given the observed impairment of STING- 
mediated ion leakage upon treatment with C53 
both in cells and in vitro, we next asked whether 
C53 could inhibit other downstream functions 
of STING activation. We first tested whether 
STING-induced LC3B lipidation could also be 
inhibited by C53. Indeed, treatment with both 
cGAMP and noncyclic dinucleotide agonists 
MSA-2 or diABZI induced LC3B lipidation, 
whereas cotreatment with C53 strongly im- 
paired LC3B lipidation without associated in- 
hibition of STING phosphorylation or STING 
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Fig. 3. STING-induced LC3B lipidation is inhibited by C53, and STING 
S53L is less sensitive to C53-mediated inhibition of pH increase and 

LC3B lipidation. (A) Representative images of stably expressed RFP-LC3B and 
STING-HA in FIP200 KO 293T cells upon 1 uM diABZI stimulation for 1 hour with 
or without 10 uM C53 cotreatment. Scale bar, 20 wm. DAPI, 4',6-diamidino-2- 
phenylindole. (B) Quantification of experiment in (A), representing three 
biological replicates combined. Error bars indicate SD. One-way ANOVA followed 
by Tukey's HSD: ***P < 0.001; ****P < 0.0001; n.s., P > 0.05. (C) Immunoblots 
for indicated proteins in BJ1 cells with or without cotreatment with 10 uM 

C53 upon 20 ug/ml cGAMP (permeabilized with 5 ug/ml digitonin), 40 uM 


MSA-2, or 2 uM nigericin stimulation. One representative experiment of n = 3 
experiments. (D) Quantification of pH change from 0 to 60 min after 1 uM diABZI 
stimulation with or without 10 uM C53; data from three biological replicates were 
combined. STING was knocked out in BJ1 cells followed by overexpression of 
STING WT (left) or STING S53L (right). Shaded region denotes SD. One-way 
ANOVA followed by Tukey's post hoc at the end point measurements: **P < 0.01; 
****P < 0.0001; ns., P > 0.05. (E) Representative images of BJ1 cells assayed in 
(D). Scale bar, 20 um. (F). Immunoblots of indicated proteins in 293T cells 
expressing STING WT or STING S53L treated with 1 uM diABZI with or without 

10 uM C53 for 1 hour. One representative experiment of n = 3 experiments. 


translocation (Fig. 3, A to C, and fig. S3, A and B). 
C53 cotreatment did not greatly inhibit LC3B 
lipidation induced by nigericin, an ionophore 
that induces noncanonical LC3B lipidation 
independently of STING (27) (Fig. 3, B and C), 
suggesting that C53’s activity is specific to STING- 
dependent LC3B lipidation. To further exclude 
a STING-independent effect for C53, we knocked 
out endogenous STING in BJ1 cells expressing 
the cis/medial-Golgi pH reporter and over- 
expressed WT STING or STING S53L (fig. S3C), 
a STING variant with reduced binding to C53 
(23). We then measured pH changes upon stim- 


ulation with diABZI and observed that C53 
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cotreatment inhibited agonist-mediated pH 
increases in cells expressing WT STING but had 
no significant effect in cells expressing STING 
S53L (Fig. 3, D and E). Similarly, 293T cells 
stably transduced with STING S53L exhibited 
reduced sensitivity to C53-mediated impairment 
of LC3B lipidation induced by diABZI treat- 
ment relative to cells expressing WT STING 
(Fig. 3F). 


STING’s channel activity is required for its 
activation of the NLRP3 inflammasome 


In addition to induction of interferon and LC3B 
lipidation, STING activates the NLRP3 inflam- 


masome in human myeloid cells (6), but the 
mechanism remains unclear. The influenza virus 
M2 pore protein induces noncanonical LC3B 
lipidation by inducing proton leakage (13), while 
also activating the NLRP3 inflammasome (28). 
Perhaps then STING might activate the inflam- 
masome in a similar manner, and C53 could 
block this activity. Upon activation, NLRP3 
translocates from the cytosol to Golgi vesicles, 
where it initiates downstream inflammasome 
activation (29). Using an NLRP3-mNeonGreen 
reporter, we found that NLRP3 formed puncta 
upon stimulation with the STING agonist 
diABZI (Fig. 4A), similarly to when cells were 
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Fig. 4. STING-induced NLRP3 inflammasome activation and IL-1 release 
Representative images of pSTING, STING, and NLRP3 
in HEK293T cells expressing STING-HA and NLRP3-mNeongreen (NLRP3-mNg) 
treated with DMSO and 1 uM diABZI with or without 10 uM C53 for 1 hour. Scale 
bar, 20 um. (B) NLRP3 translocation quantified as the per-cell maximum NLRP3 
in (A) from three biological replicates combined. 
One-way ANOVA followed by Tukey's post hoc: ****P < 0.0001; n.s., P > 0.05. 
(C) Same as (B) but quantifying pSTING intensity. (D) Experimental workflow for 
primary human monocytes. (E) Immunoblots of 
processed IL-18 from human monocytes (primed with R848) upon no stimulus 


are inhibited by C53. (A) 


intensity from experiment 


inflammasome induction in 


stimulated with the NLRP3 agonist nigericin 
(fig. S4A) (29). Furthermore, NLRP3 colocal- 
ized with STING and phosphorylated STING 
(pSTING) on these puncta (Fig. 4A). Consis- 
tent with the hypothesis that STING-induced 
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proton leakage is the driver of downstream 
NLRP3 activation, when we treated cells with 
both diABZI and C53, we observed a signifi- 
cant reduction in NLRP3 translocation together 
with an enhancement in STING phosphoryl- 


(NS), 10 pg/ml cGAMP, 1 uM diABZI, or 6.7 w 
absence or presence of 10 uM C53 or 5 uw 
One representative donor of n = 3 donors 
measurement from stimulated human monocytes [(left) Pam3CSK4 primed, 
(right) R848 primed] of processed IL-18 upon 
1 uM diABZI, or 6.7 uM nigericin stimulation in the absence 
10 uM C53 or the 5 uM NLRP3 inhibitor 
one donor with total n = 4 donors. Error bars indicate SD. Two-way ANOVA 
followed by Dunnett's multiple comparisons test: n.s., P > 0.05; *P < 0.05; 
***P < 0.001; ****P < 0.0001. 


LEE LEE LES 


RS RS 


R848 primed monocytes 


nigericin stimulation in the 
NLRP3 inhibitor MCC950 (MCC). 

tested. (F) Supernatant cytokine 

S, 10 ng/ml cGAMP, 

or presence of 

CC950; each data point represents 


ation (Fig. 4, B and C). We also tested whether 
STING-induced LC3B lipidation could have a 
role in STING-induced inflammasome activa- 
tion by knocking out ATGI6L1 in BLaER1 cells 
(fig. S4B), a human cell line that can be 
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transdifferentiated to monocytes and in which 
STING activation leads to NLRP3-dependent 
interleukin-18 (IL-18) release (6). Knockout of 
ATGI6L1 did not impair diABZI-induced IL- 
1B release (fig. S4C). Thus, STING-induced in- 
flammasome activation is independent from 
STING-induced LC3B lipidation. Lastly, we 
tested whether C53 could block STING-induced 
inflammasome activation as measured by 
IL-1f release and cell death in primary CD14* 
monocytes (Fig. 4D). In agreement with our 
findings in HEK293T cells, C53 cotreatment 
significantly impaired STING-induced inflam- 
masome activation, inhibiting IL-1f release 
(Fig. 4, E and F) and cell death (fig. S4D) in 
primary human monocytes stimulated with 
cGAMP or diABZI to a level similar to that of 
monocytes treated with the NLRP3 inhibitor 
MCC950. C53 did not affect IL-1B release (Fig. 
4, Eand F) or cell death (fig. S4, C and D) when 
the NLRP3 inflammasome was activated by 
nigericin, further indicating that C53 impairs 
activation of the NLRP3 inflammasome in a 
STING-specific manner. Thus, similarly to the 
influenza protein M2, STING activates the 
NLRP3 inflammasome through induction of 
a proton leakage. 


Discussion 

Here we demonstrated that STING activation 
induces proton leakage at the Golgi through a 
channel formed at the interface of the STING 
homodimer’s transmembrane domains. This 
STING-mediated pH increase is inhibited by 
the small molecule C53, whereas no inhibition 
of pH increase was observed in cells express- 
ing STING S53L, a STING variant with reduced 
binding to C53 (23). We also showed that STING 
transports protons in an in vitro liposome 
assay and that C53 treatment inhibited STING 
proton transport in vitro. 

In addition to demonstrating proton trans- 
port through purified STING in vitro and STING- 
dependent pH changes in cells, we also found 
that STING’s channel activity is critical for 
downstream activation of LC3B lipidation and 
of the NLRP3 inflammasome because treat- 
ment with C53 impaired these activities without 
reducing STING phosphorylation. C53 treat- 
ment also inhibited NLRP3 inflammasome 
activation downstream of STING in primary 
human monocytes. These findings provide an 
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avenue for decoupling STING phosphoryla- 
tion from induction of LC3B lipidation and 
inflammasome activation induced by STING. 
Comparing the effects of agonists that bind 
STING’s natural binding pocket or a pore- 
associated pocket, such as C53, could help 
determine the relative importance of STING 
phosphorylation versus channel-mediated down- 
stream functions in diverse biological con- 
texts and therapeutic applications. 
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Tumor microenvironments (TMEs) influence cancer progression but are complex and often differ between 
patients. Considering that microenvironment variations may reveal rules governing intratumoral cellular 
programs and disease outcome, we focused on tumor-to-tumor variation to examine 52 head and neck 
squamous cell carcinomas. We found that macrophage polarity—defined by CXCL9 and SPP1 (CS) 
expression but not by conventional M1 and M2 markers—had a noticeably strong prognostic association. 

CS macrophage polarity also identified a highly coordinated network of either pro- or antitumor variables, which 
involved each tumor-associated cell type and was spatially organized. We extended these findings to other 
cancer indications. Overall, these results suggest that, despite their complexity, TMEs coordinate coherent 
responses that control human cancers and for which CS macrophage polarity is a relevant yet simple variable. 


umor microenvironments (TMEs) are 

ecosystems composed of diverse stromal, 

immune, and cancer cells. The interac- 

tions between these components are in- 

completely understood, but some of them 
produce pro- or antitumor effects (J). It is there- 
fore essential to decipher the components of 
these ecosystems and better understand some 
of their internal governance and rules of in- 
teraction. The results could lead to an improved 
understanding of the regulatory mechanisms 
that determine cancer progression as well as 
resistance or response to treatment and, by 
extension, better define patient prognosis and 
treatment options. To date, many studies have 
described TMEs using low-resolution tech- 
niques, such as bulk RNA sequencing (RNA-seq) 
and immunohistochemistry. These data have 
allowed classification of TMEs on the basis of 
broad criteria, such as the composition of the 
immune infiltrate and the character of the in- 
flammatory response (2). For example, the abun- 
dance or phenotype of various immune cells 
within tumors, including T cells (3-5), B cells 
(6-8), dendritic cells (DCs) (9-11), and macro- 
phages (12-14), may be associated with distinct 


clinical prognoses. More recently, single-cell 
RNA sequencing (scRNA-seq) has been in- 
creasingly used to define the TME because it 
directly reveals its cellular heterogeneity with- 
out the need for predefined sets of markers 
for cell identification (15-20). Nevertheless, 
scRNA-seq studies typically consider cells from 
all patients as units of statistical replication, 
which prevents formal testing of tumor-to- 
tumor variation between patients. In this work, 
we designed a population-oriented scRNA-seq 
approach that considers tumors, not cells, as 
units of statistical replication. This approach 
allowed us to discover features that covary be- 
tween individuals and thus to better under- 
stand the complexity of TMEs and the rules 
governing their composition and relationship 
to disease outcome. We also performed his- 
tological analyses to validate the scRNA-seq- 
based results and obtained spatial information 
on the variables of interest within the TME. 


Tumor-associated macrophages are major 
contributors to the clinical outcome of HNSCCs 


We generated a scRNA-seq dataset from 52 
fresh head and neck squamous cell carcinoma 


range of clinical characteristics, spanning the 
clinical spectrum from primary tumors to loco- 
regional recurrences to distant metastases, 
and without selection or enrichment for a par- 
ticular cell type before sequencing (fig. S1 and 
table S1). We defined cell clusters from a total 
of 187,399 cell transcriptomes using Seurat’s 
FindCluster and manually annotating them at 
different resolutions from main cell compart- 
ments to minor cell states (fig. S2). This data- 
set comprehensively covered all three main 
cell compartments, namely tumor, stromal, and 
immune cells, as well as all major cell types, 
namely tumor and epithelial cells, fibroblasts, 
endothelial cells, lymphocytes, and myeloid 
cells (Fig. 1A and figs. S3 and S4). These cell 
types were reproducibly found in patients (fig. 
S5A), but their abundances varied greatly (fig. 
S5B). Epithelial cells, fibroblasts, endothelial 
cells, lymphocytes, and myeloid cell states con- 
tained a finite number of minor cell states, 
which were also reproducibly found in pa- 
tients; this was in stark contrast to the tumor 
cell states from the same patients, which were 
largely patient specific (fig. S5C). 

We identified 1189 genes with expression in 
the MGH/MEE-HNSCC cohort that was dom- 
inant (i.e., at least threefold higher) in a given 
cellular compartment relative to all others 
(Fig. 1B, left, and table S2). We then used this 
gene list to interpret a collection of bulk mRNA- 
seq data from 886 HNSCC patients obtained 
by pooling three independent cohorts [The 
Cancer Genome Atlas (TCGA), LPZG, and FHUT] 
(table S3). The expression heatmap of these 
genes in all 886 patients (Fig. 1B, middle, and 
fig. S6A) showed mixing of the cohorts, indicat- 
ing intercohort homogeneity. It also showed 
an association between human papilloma virus 
(HPV) status and molecular tumor subtype, as 
expected (21). Finally, pairwise correlation 
analysis showed that the TME compartment- 
dominant genes were coenriched in their re- 
spective cellular compartments (Fig. 1B, right), 
indicating that what becomes apparent as co- 
regulation in bulk mRNA data may in fact 
largely be driven by heterogeneity in abun- 
dance of various TME cell types. Notably, tu- 
mor cell-gene intrinsic clustering was still 
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associated with differential abundance of stro- 
mal and immune cell transcriptomes (fig. S6B), 
which suggests the existence of an orchestrated 
cellular communication between different TME 
compartments. 

Using the information obtained from the 
analyses above, we next investigated the influ- 
ence of the three main cellular compartments 
(tumor, stroma, and immune) on clinical out- 
come. We applied supervised methods that are 
commonly used for bulk mRNA-seq analysis 
but considered only the cell type-dominant 
genes as the feature variables. To ensure in- 
ternal normalization of gene expression in 
each cell type, we used a pairwise log ratio 
method (22) followed by construction of multi- 
pair signatures using an elastic network clas- 
sifier (23) that was trained on overall patient 
survival. Univariate analysis showed that each 
major cellular compartment had a signifi- 
cant prognostic impact (fig. S7A). However, a 
multivariate analysis in which all three com- 
partments compete to explain the patient’s 
clinical course showed that only the stromal 
(P = 0.01) and immune (P = 0.0003) signa- 
tures retained their prognostic ability (Fig. 1C), 
which indicates a more-direct association be- 
tween these nontumor components with dis- 
ease progression. 

Because immune cells include different sub- 
sets with distinct functions, we sought to deter- 
mine which cells carried the most prognostic 
information. In univariate analyses, every ma- 
jor immune cell type except monocytes had a 
significant prognostic impact (fig. S7B); how- 
ever, in multivariate analyses, only mast cell (P = 
0.04) and macrophage (P = 0.01) signatures 
retained independent effects (Fig. 1D). This 
showed that prognostic information from the 
different cell types was mostly redundant and 
suggested close coordination between these 
cells. We also illustrated the relevance of the 
macrophage-specific signature in Kaplan-Meier 
survival analyses (Fig. 1E). 

Among the prognostic gene pairs identified 
in macrophages, we selected CXCL9 and SPPI 
because the expression ratio of these two genes 
(CS) sufficiently captured the information in 
the entire signature (fig. S8A), and these two 
genes already have established and arguably 
opposing roles in cancer biology (24-26). The 
association of the other gene pairs with sur- 
vival is shown in fig. S8A. CS tumor-associated 
macrophage (TAM) polarity was associated 
with overall survival when analyzed separately 
in the three independent HNSCC cohorts (fig. 
S8, B and C); it also remained linked to disease 
outcome independently of CD274 expression 
(fig. S8D), which is frequently considered pre- 
dictive of a patient’s clinical course (27). Further- 
more, CS TAM polarity remained independent 
of tumor HPV status (fig. S9, A to C) and disease 
stage (fig. S9, D and E). Taken together, these 
data indicate that the TAM gene expression 
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programs were particularly important in de- 
fining disease progression. 


CXCL9 and SPP1 identify TAM polarity in vivo 


Quantitative analyses of scRNA-seq (Fig. IF) 
and histological (Fig. 1G) data both showed 
that expression of CXCL9 and SPP1 by macro- 
phages was largely mutually exclusive at the 
single-cell level. Also, by inferring the abun- 
dance of TAMs and their CS polarity in bulk 
RNA-seq data (fig. S10), we found that CS po- 
larity was prognostic, whereas TAM abundance 
was not (Fig. 1H). Patients with more PPI" 
TAMs had poorer clinical outcomes, whereas 
patients with more CXCL9”™ TAMs had better 
ones, and the CS ratio correlated strongly with 
our initial macrophage prognostic score; by 
contrast, TAM expression of M1 (/TGAX and 
CD80) or M2 (CD163 and MRCI) macrophage 
markers, which are widely used to define macro- 
phages with anti- and protumor functions, re- 
spectively, were not related to distinct clinical 
outcome in the same patients (Fig. 11). These 
results supported the notion that M1 and M2 
phenotypes define the possible extremes of 
polarized cells in vitro but may not necessarily 
reflect the complexity of TAMs in vivo (28). 
Finally, CS TAM polarity was low pretreat- 
ment in nine out of nine patients who did not 
respond to anti-PD1-containing treatment regi- 
mens but high in three out of five patients who 
responded to treatment (Fig. 1J). This is addi- 
tional evidence that CS TAM polarity alone 
may carry meaningful clinical information. 

Next, we evaluated whether CXCL9 and/or 
SPPI expression was restricted to discrete, minor 
monocyte, or TAM states. To this end, we first 
used unsupervised clustering of sCRNA-seq data 
to reveal the complexity of monocytes and macro- 
phages in HNSCCs. We found stereotyped gene 
expression programs that defined a total of four 
conserved monocyte states and seven conserved 
TAM states (fig. S11A), which were reproducibly 
found in patients (fig. SIIB). Each state was 
named after the gene that it expressed domi- 
nantly over all other monocyte and TAM states 
(fig. SIIC and table S4). Monocytes mostly ex- 
pressed low levels of CXCL9 and SPP1. Among 
TAMs, Mac_APOE, Mac_CXCL10, and Mac_Fi3A1 
preferentially expressed CXCL9, and Mac_SPPI 
and Mac_MTIH preferentially expressed SPP1 
(fig. S1ID); however, all states of TAMs expressed 
both transcripts at variable levels (these findings 
are further discussed below and are presented 
in Fig. 2F and figs. S19 and S20). 

We also examined whether any of the ob- 
served monocyte or TAM states corresponded 
to M1 or M2 cells (fig. SIIE). To this end, we 
used MO, M1, and M2 signatures of in vitro- 
polarized cells and involving a total of 596 genes 
identified from public dataset GSE158094 (29). 
We found that SPP? and CXCL9”™ TAMs all 
expressed elevated MO, M1, and M2 signatures 
compared with monocytes and Mac_MARK4. 


The only substantial differences were higher M2 
versus M1 signatures in Mac_SPPI and higher 
MI versus M2 signatures in Mac_CXCLI0. These 
results align with previous findings suggesting 
that the phenotypes of human TAMs only par- 
tially overlap with those of M1 and M2 mark- 
ers (18, 28, 30, 31). Therefore, the study of 
CXCL9 and SPP1 expression by TAMs should 
be more relevant to capture the polarity of these 
cells in vivo and their association with other 
TME components and disease progression. 


CXCL9 and/or SPP1 TAM polarity defines the 
broader HNSCC microenvironment 


For subsequent analyses, we used the CS ratio 
to assess TAM polarity in vivo (Fig. 2A). Further- 
more, we examined tumor-to-tumor variation 
of TAM polarity, considering that this varia- 
tion may reveal rules governing the cellular 
and molecular composition of the TME. Spe- 
cifically, we classified the 52 samples according 
to CS TAM polarity and performed correlation 
analyses between this variable and other TME 
variables, including the abundance of different 
cell types and their gene expression profiles. 
For the latter, the main unit of analysis was the 
expression of a gene averaged over all cells 
from the same sample and cell state (fig. S12). 
The resulting value is independent of cell abun- 
dance, unlike the so-called pseudobulk sums 
that are typically used (32). In doing so, we 
were able to detect co-regulation of gene ex- 
pression between cell types, which indicates 
the existence of a coordinated communication 
network in the TME, as detailed below. 

CS TAM polarity was positively associated 
with increased tumor infiltration by three ma- 
jor immune cell types, namely T cells, B cells, 
and DCs (Fig. 2B and fig. S13). Operationally 
dividing T cells into well-defined transcrip- 
tional states (33) revealed that all of them con- 
tributed to the association between CS TAM 
polarity and overall T cell abundance (fig. S14, 
A and B). Histological analyses on tissue sec- 
tions (n = 23) derived from the same tumors 
that were analyzed by scRNA-seq confirmed 
this finding for various CD4* and CD8* T cell 
populations (Fig. 2C and fig. S15). Similarly, 
both plasma and nonplasma B cells contributed 
to the association between CS TAM polarity 
and overall B cell abundance (fig. SI6A). On 
the other hand, operationally dividing in- 
tratumoral DCs into well-defined states (34) 
showed that only the abundance of type 
1 conventional DCs (cDC1_XCRI) was asso- 
ciated with CS TAM polarity (fig. S16B). Also, 
CS TAM polarity did not associate with the 
abundance of other stromal or immune cell 
types, including TAMs themselves (Fig. 2B). 
Therefore, the quality of TAMs, not their quan- 
tity, was related to the intratumoral abundance 
of three cell types, namely T cells, B cells, and 
DCs, all of which have been linked to anti- 
tumor immunity (2). 
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(CD68) on 10 samples from the MGH/MEE-HNSCC cohort. Representative 
histology images (left) and quantification of TAM positive or not for CXCL9 
and/or SPPI (right) are shown. (H) Cox regression analysis in bulk RNA-seq cohorts 
[as in (B) and (E)] comparing the prognostic impact of TAM abundance signature 
(derived from the pseudobulk profiles of the scRNA-seq data using GLMNET 
classifiers) and the CXCL9:SPPI1 (CS) ratio. Wald 95% confidence intervals and 
P values are shown. (1) Scatter plots showing lack of correlations between 


Our scRNA-seq data further showed that 
M1 or M2 markers, in contrast to CXCL9 and 
SPP1, were not strongly associated with T cell, 
B cell, or DC abundance in the same tumors 
(Fig. 2D). Histological staining of CD8* T cells 
in tumors confirmed these findings (fig. S17). 
Overall, these data revealed the existence 
of a coordination between CS TAM polarity 
and adaptive immune responses in human 
HNSCC; they also indicated the limited link 
between traditional M1 and M2 markers and 
such immune responses. 

We extended our analyses of tumor-to-tumor 
variation in CS TAM polarity by investigating 
its connection to gene expression in all TME 
cell types. Initially, we assessed CXCL9 and 
found that not only TAMs but also tumor cells, 
fibroblasts, endothelial cells, monocytes, and 
DCs could express it, although at lower levels 
(Fig. 2E). Notably, all these cell types did so in 
a coordinated manner with TAMs. For exam- 
ple, patients with higher CXCZ9 expression in 
TAMs also had higher CXCL9 expression in 
these other cell types. In a like manner, tumor 
cells and monocytes could express SPPI at lower 
levels; they did so in a coordinated manner 
with TAMs and opposite to CXCL9 expression 
(Fig. 2E). This harmonized gene expression 
across cell types was independent of key tu- 
mor and patient characteristics, namely HPV 
or Epstein-Barr virus (EBV) status, nature of 
specimen obtained, anatomic site of tumor 
origin, sex, and smoking history (fig. $18). To- 
gether, these results suggested that CS TAM 
polarity could broadly inform on related activ- 
ities of other cells of the TME. 

All conserved minor macrophage states ex- 
pressed both CXCL9 and SPP1; they did so at 
varying levels but in a coordinated manner 
(Fig. 2F and fig. S19). For example, Mac_CXCLI0 
cells expressed CXCL9 at the highest level on 
average, but all other minor macrophage states 
expressed CXCL9 to some degree. The level of 
CXCL9 expression in Mac_CXCL10 cells varied 
between patients, but the interpatient varia- 
tion was preserved in all other minor macro- 
phage states. Analysis of other minor cell states 
indicated that the results obtained for macro- 
phages were broadly generalizable. For example, 
among monocytes, Mo_LYPD2 and Mo_CCL3L1 
expressed CXCL9 in a manner that was co- 
ordinated with each other and with that of 
macrophages; similarly, Mo_CD300LB and 
Mo_CCL3L1 expressed SPP1 in a coordinated 
manner (fig. S20A). The same findings ap- 
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the macrophage signature 


containing treatment regim 


plied to DCs: cDC1_XCR1, DC3_LAMP3, and 
cDC2/MoDC_CLECIOA all expressed CXCL9 
in a manner that was coordinated with that of 
macrophages (fig. S20B). Minor states that did 
not follow this coordination rule were gen- 
erally those in which the gene of interest was 
never expressed; for example, Mo_PADI/4, 
Mo_CD300LB, cDC2_CDIA, and pDC_LILRA4 
largely lacked CXCL9 expression in the 52 tumor 
samples analyzed. We concluded that CXCL9 
and SPPI were primarily expressed in macro- 
phages, with expression of both genes co- 
ordinated across macrophage states and, more 
broadly, across various immune and nonim- 
mune cells in the TME. 

Next, we investigated whether CS TAM po- 
larity was codependent on that of other tran- 
scripts. To this end, we again studied all TME 
cell types. We found that 2320 cell type-gene 
combinations (including 1812 individual genes) 
were significantly associated with CS TAM 
polarity [false discovery rate (FDR) < 0.05; 
table S5 and fig. S21]. The expression of a few 
of these genes, such as STATI (signal trans- 
ducers and activators of transcription 1), was 
globally shared among all immune and non- 
immune cell types (Fig. 2E). The coordinated 
expression of STATI in different cell types 
suggests the existence of coordinated pro- 
grams that can occur across the entire TME 
and correlate with CS TAM polarity; yet, cell 
type-specific programs correlating with CS 
TAM may be directly or indirectly linked to 
STAT] activity and possibly be influenced by 
STAT1-independent pathways. The expression 
of other genes was instead coordinated with 
CS TAM polarity only in certain cell types, such 
as CXCR6 in T cells and SLC2A7 (GLUTI) in 
tumor cells and macrophages (Fig. 2E). In 
addition, CS TAM polarity was directly asso- 
ciated with the expression level of a large set of 
cytokines and cytokine receptors, either posi- 
tively (i.e., genes expressed more strongly in 
cs" tumors, such as CXCL9, CXCL10, CXCLI, 
CXCL16, and IL7) or negatively (i.e., genes ex- 
pressed more strongly in cs! tumors, such as 
CXCLS5, CXCL8, ILIA, ILIB, and ILIRN) (Fig. 2G). 
The abundance of other genes, such as JFNG, 
CCL5, and CXCL13, was indirectly correlated 
with CS TAM polarity, as it depended on the 
abundance of the cell state expressing this 
gene (fig. S22). Finally, some genes were co- 
ordinated with CS TAM polarity only in certain 
minor cell types, such as JLJ2B in DC_LAMP3 


significant Fisher's exact test. 


but no other DC state (fig. S23). Overall, this 


score [(D) and (E)] and common M1 and M2 markers 


and substantial correlations between the whole signature with individual 
expression of CXCL9 and SPPI, as well as the CS ratio. Spearman's rank 
correlation is used, and, when significant, a fitted red line is shown. (J) Association 
between CS ratio and response to an anti-PD1 monoclonal antibody (mAb)- 


en in 14 patients of the MGH/MEE-HNSCC cohort, with 


coordination of expression of many genes, all 
connected to CS TAM polarity, occurred simul- 
taneously in virtually all TME cell types. 

To determine whether the genes associated 
with CS TAM polarity globally contributed to 
distinct biological processes, we performed gene 
set enrichment analyses in each cell type sep- 
arately. This allowed us to reveal not only which 
gene expression programs were associated with 
CS TAM polarity, but also in which cell types 
these programs were active (Fig. 2H). cs" TAM- 
associated pathways included immune-related 
interferon-o. (IFN-c.) and IFN-y signaling in al- 
most all cell types; interleukin-6 (IL-6), Janus ki- 
nase (JAK), and/or STATS signaling in tumor 
cells, fibroblasts, and endothelial cells; and com- 
plement activity in tumor cells, fibroblasts, and 
macrophages. By contrast, CS” TAM-associated 
pathways included epithelial-mesenchymal tran- 
sition (EMT), hypoxia, and mammalian target 
of rapamycin (mTOR) signaling in almost all 
cell types; glycolysis and angiogenesis in tumor 
cells, myeloid cells, and T cells; tumor necrosis 
factor-a (TNFa) signaling in myeloid cells and 
T cells; and transforming growth factor-B (TGFB) 
signaling in monocytes, mast cells, and T cells. 
Taken together, these data linked CS TAM po- 
larity to a variety of immune and nonimmune 
cell programs, all of which have been indepen- 
dently proposed to regulate tumor growth in a 
positive or negative manner and which we reveal 
to be deeply interconnected in human tumors. 

To identify possible cellular interactions in- 
volved in this broad program of co-regulation 
associated with TAMs, we used the well-curated 
subset of ligand:receptor pairs obtained from 
the NicheNet database (35) and only considered 
those for which the expression of the ligand 
and/or the receptor correlated with that of 
CXCL9 or SPPI in TAMs. Additionally, we man- 
ually annotated these predicted interactions a 
priori as putatively immuno-activating, immuno- 
inhibiting, or unknown on the basis of the na- 
ture of the role of the ligand or receptor reported 
in the literature (table S6). This analysis identi- 
fied multiple ligand:receptor interactions as- 
sociated with CXCL9”" TAMs, many of which 
were putatively immuno-activating and did 
not involve TAMs (Fig. 2], left). In particular, 
we found predicted interactions between DCs 
and T cells through the ligand:receptor pairs 
IL12B:ILI2RB1, CXCL16:CXCR6, XCL1:XCRI1, 
and CXCL9:CXCR3. All of these molecular in- 
teractions involving tumor-associated DCs 
have been shown to promote local antitumor 
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scRNAseq (n=52 tumors, MGH/MEE-HNSCC cohort) 
Major cell types. Patients ordered from lowest to highest CS TAM ratio. 
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scRNAseq (n=52 tumors, MGH/MEE-HNSCC cohort) 
Minor macrophage states. Patients ordered from lowest to highest CS TAM ratio. 


OXCLO SPP1 


(reads/cell) 


Me@_APOE}|*” 


Mo_SPP1 
Mo_MT1H 
Ma_MARK4 
Mo_CCL18 
Mo_CXCL10 
Mo_F13A1 
Mgo_SPP7 
Mg_MTIH 
Ma_MARK4 
Mo_CCL78 
Mg_APOE 
Mg_F13A1 


Mo_CXCL10 


Fig. 2. CXCL9:SPP1 TAM polarity determines the broader TME. (A) Patient 
ranking according to the CS TAM ratio. (B) Cell counts of major cell types. 
Patients were ranked from lowest to highest CS TAM ratio. Significant correlation 
based on Spearman's rank correlation with CS TAM is indicated with a red 
line. Each dot represents the value for one sample, with the dot size being 
representative of the cell number contributing to that value. (C) Quantitative IF 
on 23 samples showing correlation between CS TAM ratio and CD8* T cell 
abundance in both tumor nest and stroma. DAPI, 4',6-diamidino-2-phenylindole. 
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CXCL9-associated ligand:receptor interactions 
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SPP1-associated ligand:receptor interactions 


Interactions involving SPP1 
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(D) Correlations between T cell, B cell, and DC abundance (relative to TAM 
counts) and common M1 and M2 markers, CXCL9, SPP1, and CS ratio. Spearman 
rank correlation is used, and, when significant, a fitted red line is shown. (E) Cell 
type- and patient-specific expression of example genes, plotted according 

to patient order defined by CS ratio. The left panels show genes that are 
positively correlated with CS ratio, and the right panels show negatively correlated 
genes. (F) Association between CS TAM polarity and the expression of CXCL9 
(left) and SPPI (right) in minor TAM states. (G) Heatmap summarizing the 


5 of 10 


RESEARCH | RESEARCH ARTICLE 


correlation analysis [similar to (E)] when applied to 195 cytokines in all cell types 
and indicating those that are significant in at least one cell type (FDR < 0.05). 

(H) Heatmap of gene set enrichment analysis results, applied separately to each cell 
type and taking as the input the correlation with CS ratio of (G), showing significant 
MSigDB hallmark gene sets. (I) Circos plots showing well-known ligand-receptor 
pairs from curated databases (as compiled in the NicheNet software package) under 


T cell functions in experimental mouse mod- 
els (25, 36-39). We also found predicted inter- 
actions between TAMs and T cells through the 
CXCL9:CXCR3 pair. The ability of macrophage- 
derived CXCL9 to stimulate antitumor T cell 
immunity has been shown in mice (40). CXCL9 
expression in TAMs was also associated with 
putatively immuno-inhibiting CD274:PDCD1 
(PD-L1:PD1) interactions between tumor cells 
and T cells. This is consistent with our find- 
ings that tumors with cs"! TAMs were en- 
riched in IFN-y signaling and that CD274 is an 
IFN-y-inducible gene (47). Overall, the CXCL9- 
associated interaction network was strongly 
associated with inflammatory reactions and 
antitumor immunity. The interaction network 
associated with SPP?” TAMs was more lim- 
ited than that associated with CXCL9” TAMs 
and focused less on immune cell interactions 
(Fig. 21, right). SPP] was expressed in TAMs 
and tumor cells, whereas known SPPI receptors 
(integrins and CD44) were found in a variety 
of cell types, such as fibroblasts and endo- 
thelial cells, consistent with the role of SPPI 
in metastasis and angiogenesis (42). Because 
SPP1 is an extracellular matrix protein, activa- 
tion of the SPPI-associated interaction net- 
work in tumors may reflect a shift from tissue 
inflammation to tissue repair. 

To refine the above results, we further inves- 
tigated interactions at the level of minor cell 
states (fig. S24). We found that interactions of 
some ligand:receptor pairs, such as CXCL9: 
CXCR3, CXCL16:CXCR6, and CD40LG:CD40, 
were rather pleiotropic because they involved 
many different minor cell states (with inter- 
esting details, such as some minor DC states 
potentially acting as both sender and receiver 
in CXCL9:CXCR3 interactions). By contrast, other 
ligand:receptor pairs, such as XCL1:XCR1 and 
IL12B:IL12RBI1, connected only certain minor 
cellular states. For instance, XCLI:XCRI1 inter- 
actions selectively involved cDCls as a sensor 
of the chemokine XCL1, which is known to re- 
cruit cDCis to the TME and promote immune 
control of cancer (38). Similarly, JL12B:IL12RB1 
interactions selectively involved DC_LAMP3 
DCs [previously identified as DC3 (18, 34), 
mregDC (20), and LAMP3* DC (7)] as producers 
of IL-12, which can locally stimulate antitumor 
immunity (36). 

These data indicate that CS TAM polarity 
identifies critical cellular and molecular TME 
programs. These programs have been inde- 
pendently recognized for their ability to reg- 
ulate tumor growth, and we show that they are 
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all connected and identifiable by CS TAM 
polarity. 


Regulation of CXCL9 and/or SPP1 TAM polarity 


One possible explanation for the largely mu- 
tually exclusive expression of CXCL9 and SPPI 
in TAMs could be that the tissue niche pro- 
vides discrete, cell-extrinsic cues that promote 
the generation of CXYCL9* or SPPI* TAMs. Thus, 
we first investigated whether these cells dis- 
tributed differently in tumors. Histological 
analyses, combining RNA in situ hybridization 
(RNA-ISH) with immunofluorescence (IF), not 
only confirmed the CS TAM polarity of tumors 
initially identified by scRNA-seq (fig. S25), but 
also showed that TAMs accumulated often as 
clusters of either CXCL9* or SPPI' cells (Fig. 3, 
A and B, and fig. S26, A to C). The distribution 
of CXCL9* and SPPI* TAMs in the tumor stroma 
and tumor nests varied among patients, al- 
though there were increased accumulations 
of CXCL9* or SPPI* TAMs at the interface be- 
tween the tumor stroma and tumor nests in 
cs™ or csi? patients, respectively (fig. S26, D 
and E). By contrast, CKCL9” SPP TAMs were 
more found in the tumor stroma and further 
away from the interface between stroma and 
nest (fig. S26E). Additional histological analy- 
ses showed that tumor cells expressing CXCL9 or 
SPP1 were more likely to be found at the vicinity 
of CXCL9* or SPPI* TAMs, respectively (Fig. 3, 
C and D, and fig. S26F). These data align with 
the identification of a coordinated regulation of 
CXCL9 and SPPI expression between different 
cell types in the same tumors (Fig. 2E); they also 
suggest that local cues may foster the acquisi- 
tion of CXCL9* or SPPI* phenotypes. 

To examine whether distinct environmental 
signals might foster the production of CXCL9* 
or SPPI* TAMs, we characterized cellular and 
molecular neighbors of these two cell popula- 
tions. We found that CD3* and CXCR3* cells 
were enriched near CXCL9* TAMs (fig. S27, A 
and B), which validated the ligand:receptor 
analysis results suggesting CXCR3:CXCL9 inter- 
actions between T cells and macrophages (Fig. 
2I and fig. S24) and aligned with previous 
findings in mice linking CXCR3:CXCL9 sig- 
naling to intratumoral antitumor immunity 
(39). Furthermore, the neighborhood of CXCL9* 
TAMs was enriched of IENG* T cells (Fig. 3, E 
and F), which represented the majority of JFNG- 
producing cells (Fig. 3G and figs. S22 and $28). 
Conversely, the neighborhood of SPPI* TAMs 
was enriched in GLUTI' cells (Fig. 3, H and I) 
indicating hypoxic areas (43). Taken together, 


the requirement that either the ligand or the receptor (or both) is correlated with CS 
ratio in addition to both being expressed. Arrows are pointing from the ligand toward 
the receptors. Only selected example pairs are labeled and highlighted for CXCL9 
network. Tu, tumor; T, T cells; B, B cells; N, neutrophils; MC, mast cells; Mo/Mg, 
monocyte/macrophages; EC, endothelial cells; F, fibroblasts. ****P < 0.0001; 
***P < 0,001; **P <0.01; *P < 0.05. P values are adjusted for the number of cell types. 


the data indicate that CXCL9* and SPPI* TAMs 
exist in distinct neighborhoods, which con- 
tain factors that are linked to tumor control 
(T cells and IFN-y) and tumor progression (hy- 
poxia), respectively, in broad agreement with 
the results of nonspatial scRNA-seq analysis 
(Fig. 2, E and H). 

Given the physical proximity between CXCL9* 
TAMs and CXCR3* IFNG* T cells in vivo and 
the ability of IFN-y to stimulate CXCL9 expres- 
sion in other cancer settings (25), we sought to 
test in vitro whether exposing macrophages 
to IFN-y alone would promote the acquisition 
of a CXCL9* phenotype. Conversely, given the 
physical proximity between SPPI* TAMs and 
hypoxic areas identified by GLUT! positivity 
and the association between CS’? TAM polar- 
ity and hypoxia signatures, we tested whether 
hypoxia would promote the acquisition of an 
SPPI* phenotype in macrophages in vitro. To 
test these hypotheses, we first used THP-1 cells 
differentiated into macrophage-like cells (Fig. 
3J). We found that exposure to IFN-y promoted 
CXCL9 expression, whereas hypoxic conditions 
promoted SPPI expression; consequently, IFN-y 
increased the CS macrophage ratio, whereas 
hypoxic conditions decreased it. To extend these 
analyses to TAMs, we generated single-cell dis- 
sociations of primary human HNSCC biopsies 
(Fig. 3K). Again, we found that exposure to IFN-y 
promoted CXCL9 expression and increased the 
CS macrophage ratio, whereas hypoxic condi- 
tions promoted SPP] expression and decreased 
the CS macrophage ratio. These results indi- 
cate that certain microenvironmental factors, 
namely IFN-y and hypoxia, can respectively 
regulate CXCL9 and SPP] expression in TAMs. 
These results also help explain the spatial 
segregation of CXCL9* and SPPI* TAMs in 
tumors. 


Relevance of CXCL9 and/or SPP1 TAM polarity 
in a variety of cancers 


Using a pan-cancer approach exploiting bulk 
RNA-seq data from >10,000 patients (TCGA 
pooled data), we found that CS TAM polar- 
ity was strongly associated with overall sur- 
vival in this combined cohort (fig. S29A). This 
association was particularly notable for patients 
with liver, ovarian, cervical, thyroid, or bladder 
cancer, as well as for those with melanoma and 
sarcoma, in addition to patients with head and 
neck cancer as shown above (fig. S29B). These 
data confirmed that CS TAM polarity alone 
carries relevant clinical information; they also 
suggested that the relevance of this polarity 
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Fig. 3. IFN-y and hypoxia, respectively, increase and decrease CXCL9:SPP1 
TAM polarity. (A) Pseudoimage of summarized biomarker status per cell from 

a combined RNA-ISH and IF histology from a HNSCC tissue sample. Each one of the 
650,340 cells is plotted as a dot located at the center of the original cell image, 
colored according to categorized biomarker combinations. Two magnified regions 
highlighting concentration of CXCL9* or SPPI* TAMs are shown to the right. 

(B) Quantitative analysis comparing the density of the CXCL9* TAMs (top row) and 
SPP1* TAMs (bottom row) in the neighborhood surrounding every SPPI* or CXCL9* 
TAM cell (horizontal axis). The neighborhood is defined as four layers of cells 
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according to Delaunay triangulation (with ~40-um distance to the outmost layer 
and a total of ~70 cells per neighborhood). The percentage of the cell type of 
interest is compared by t test (Summarized by the fold change and P value). The 
box plots show Tukey's lower and upper hinges, the quartiles, and the median. 
Data from six samples were analyzed in the same manner. (C) Similar plot to (A), 
adding CK* tumor cells that express CXCL9 or SPPI. (D) Similar analysis to (B), 
but comparing the percentage of CXCL9* or SPPI1* tumor cells in the neighborhood 
surrounding CXCL9* or SPP1* TAMs. (E) Histology image from a HNSCC sample, 
with antibody markers for TAMs (CD68) and T cells (CD3) as well as RNA-ISH 
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probes for CXCL9 and IFNG. Areas with different CXCL9* TAMs densities are 
magnified, showing enrichment of IFNG* CD3* cells around CXCL9* TAMs. 

(F) Quantitative analysis of the histological images as in (E) from five patients 
with CS" score, showing consistent enrichment of IFNG* T cells in areas 

with high CXCL9* TAM density. (G) Percentage of CD3* cells among IFNG* 
cells. (H) RNA-ISH and IF analysis of FNG and hypoxia marker GLUT] in the 
context of SPPI* and SPPI” TAMs. (I) Quantitative analysis comparing GLUT1 
occurrence in the neighborhood of SPP1* versus SPPI” TAMs, in samples 

from three patients [with the same method as that for (B) and (D)]. (J) Impact 
of IFN-y and hypoxia on the expression of CXCL9 and SPP1 by THP-1 macrophage- 


to understanding the inner workings of the 
TME extends to many cancer indications. 

To widen our analyses to scRNA-seq data, 
we interrogated CS TAM polarity in an inde- 
pendent HNSCC cohort [7 = 18 patients (44)], 
as well as in non-small cell lung cancer (NSCLC) 
[n = 37 patients (45)] and colorectal cancer 
(CRC) [n = 62 patients (46)] cohorts. We chose 
these cohorts because they included a suffi- 
ciently large number of patients to examine 
tumor-to-tumor variations. For each cohort, we 
ranked patients according to CS TAM polarity 
and then analyzed various cell types in the 
TME. In all three newly analyzed datasets, we 
found that CXCL9 expression in TAMs was in- 
versely correlated with SPP] expression in the 
same cells (Fig. 4A), recapitulating our initial 
results obtained with the MGH/MEE-HNSCC 
cohort (Fig. IF). Therefore, the expression of 
CXCL9 and SPPI may be used to define TAM 
polarity in different cancer types. In marked 
contrast, monocytes and macrophages found 
in adjacent normal tissue were all largely CYCLO 
and SPPI (Fig. 4B), which further suggests 
that the acquisition of a CXYCL9* or SPPI* pheno- 
type requires environmental signals that may 
be absent in normal tissues. 

We extended our analysis to all cell types 
and found that in addition to TAMs, tumor 
cells and fibroblasts could also express CXCL9, 
although at lower levels. Moreover, these cells 
did so in a manner coordinated with TAMs 
(fig. S30). Tumor cells and monocytes could 
also express SPP] and did so in a way that was 
consistent with TAMs and varied inversely 
with CXCL9 expression (fig. S30). We also noted 
that fibroblasts, and to a lesser extent DCs, 
mast cells, T cells, and B cells, expressed SPPI 
in a manner coordinated with TAMs in the 
different cohorts analyzed, which suggests a 
possible extension of the synchronized expres- 
sion of CXCL9 and SPPI in the TME to these 
cells. We next investigated whether CS TAM 
polarity was codependent with that of other 
transcripts in the same patient cohorts presented 
above. We analyzed each cell type separately 
to reveal genes in each that might be coordi- 
nately expressed with CS polarity in TAMs; 
this approach identified the existence of CS 
TAM co-regulatory programs in virtually all cell 
types and in all cohorts (fig. S31). By perform- 
ing a correlation of correlations analysis, we 
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confirmed that the strong association between 
CS TAM and cell type-specific gene expression 
initially identified in the MGH/MEE-HNSCC 
cohort was largely replicated in the other three 
cohorts (Fig. 4C). In addition, CS TAM polarity 
correlated with the abundance of T cells, B cells, 
and DCs in these cohorts. The correlations were 
particularly significant for T cells and DCs in 
the CRC cohort, which contained a larger num- 
ber of patients, and for all three cell types in all 
cohorts combined (Fig. 4D). Thus, in all cancer 
indications that we were able to analyze, CS 
TAM polarity was linked to (i) the abundance 
of antitumor cell types and (ii) critical gene 
expression profiles in virtually all immune and 
nonimmune cells of the TME. 

To examine the spatial distribution of CXYCL9* 
or SPPI* TAMs in another cancer indication, 
we considered single-cell spatial transcriptom- 
ics of tissue sections obtained from lung adeno- 
carcinoma patients (47). We found that TAMs 
could accumulate as clusters of either CXCL9* 
or SPPI* cells (Fig. 4E). These data were con- 
firmed with two other approaches using the 
CellCharter algorithm (48): First, unsupervised 
identification of spatial clusters centered on 
TAMs indicated two major hubs, which were 
enriched in CXCL9* and SPPI* TAMs, respec- 
tively (fig. S32A). Second, neighborhood enrich- 
ment analysis revealed that SPP1* TAMs were 
depleted near CXCL9* TAMs and vice versa (fig. 
S32B). Furthermore, SPPI* TAMs were more 
surrounded by SLC2AI* (GLUTI) cells, whereas 
CXCL9* TAMs had more CXCR3* and IFNG* 
T cells near them (Fig. 4E and fig. S33). These 
data indicate that, irrespective of the cancer 
indication, CX¥CL9* and SPPI* TAMs may exist 
in distinct neighborhoods, which contain fac- 
tors that are linked to tumor control (T cells 
and IFN-y) and tumor progression (hypoxia), 
respectively. 

To further assess the relevance of CS polar- 
ity, we examined whether its association with 
clinical outcome was dependent on or inde- 
pendent of cancer-associated variables. These 
analyses used bulk RNA-seq data from 33 pooled 
cancer entities (n = 10,429 patients) from whom 
the clinical outcome was known. Univariate 
analyses identified that both cancer stage and 
the CS ratio were associated with clinical out- 
come (fig. S34). Furthermore, bivariate analysis 
showed that both variables remained signifi- 


like cells. Cells were exposed to 40 ng/ml of IFN-y or 1% O» for 24 hours. 
Expression of CXCL9 and SPP1 was measured by real-time quantitative 
polymerase chain reaction (RT-qPCR), and the combined CS ratio was 
calculated. Technical triplicates representative of one out of two independent 
experiments are shown. (K) Impact of IFN-y or hypoxia on the expression of 
CXCL9 and SPPI1 by TAMs obtained from two primary human HNSCC samples 
(HUG-O1 and HUG-02). The cells were obtained by enzymatic digestion of fresh 
biopsies immediately exposed to 40 ng/ml of IFN-y or 1% Oz. TAMs were 
fluorescence-activated cell sorting (FACS)-enriched after 72 hours for analysis 
of CXCL9 and SPPI1 content by RT-qPCR. 


cantly associated with clinical outcome (fig. 
S$34B). These data indicate that CS polarity 
was independent of cancer stage across cancer 
types, thereby extending our initial findings 
in HNSCC (fig. S9, D and E). Using a similar 
approach, we found that CS polarity was in- 
dependent of known EMT markers linked to 
poor prognosis (49) in both HNSCC (Fig. 4F 
and fig. S35A) and combined cancer indications 
(Fig. 4G and fig. S35B). Similarly, we compared 
how CS polarity, interferon, and effector T cell 
signatures (49) were associated with clinical 
outcome. We also considered JFNG expression 
on its own as well as CD8A expression as a proxy 
for a cytotoxic T lymphocyte (CTL) response. 
Univariate analyses identified that each vari- 
able was associated with a better clinical out- 
come in HNSCC, with the exception of JENG 
alone, whose hazard ratio was just not signif- 
icant (fig. S36A). However, bivariate analyses 
revealed that the interferon and T cell signa- 
tures were all competed by CS polarity (Fig. 4H), 
which suggests that the latter is superior and 
sufficient to identify at least some tumor- 
infiltrating lymphocyte (TIL) processes—e.g., 
those involving IFN-y responses. We extended 
these findings to all combined cancer indica- 
tions (Fig. 41 and fig. S36B), further indicating 
that CS TAM polarity alone carries meaningful 
clinical information in multiple cancer types. 


Concluding remarks 


Through a focused, clinically driven approach 
to single-cell analysis of HNSCCs, we addressed 
a major difficulty in human disease research— 
namely the very different manifestation of the 
same condition from patient to patient—and 
gained biological insight on TAM-related tran- 
scriptional co-regulation across cell types in 
the TME. Specifically, we show that cancer 
patient disease outcome is closely linked to CS 
TAM polarization, and that the latter, although 
independent of cell differentiation, exhibits 
plasticity and is strongly linked to the polar- 
ization of other immune, stromal, and tumor 
cells within the TME. CS TAM polarity identifies 
TME-wide coordination of pro- and antitumor 
pathways. The broad impact of our findings is 
highlighted in the identified relevance of this 
variable across multiple solid cancers. 

One of the conclusions of this study is that 
the CS axis is not restricted to any particular 
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Fig. 4. CXCL9:SPP1 TAM polarity identifies a coordinated network of 
cellular programs in a wide range of human cancers. (A) Scatter plot of 
CXCL9 and SPPI expression in single TAMs from three independent scRNA-seq 
datasets, including HNSCC (UPit cohort), lung, and colorectal cancer patients. 
The number of patients for each cohort is indicated above. The inserts show 
contingency tables based on dichotomized expression (positive if UMI counts are 
>3), with odds ratio and Fisher's exact test P value to indicate mutual exclusion 
(odds ratio < 1). (B) CXCL9 and SPPI expression in macrophages or monocytes, 
from tumor or adjacent normal tissue, in colorectal cancer. Healthy tissue is 
highlighted in green. (€) Comparison between cell type-specific gene expression 
correlations with CS ratio in the MGH/MEE-HNSCC cohort (fig. S21 and table S5) 
and the corresponding correlations in three independent scRNA-seq datasets, 
quantified by the correlation of correlations (P value by Spearman's rank 
correlation test). (D) Rank correlation between CS TAM polarity and relative 
abundance of T cells, B cells, and DCs (Fig. 2D) on the three independent 
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scRNA-seq datasets. P values are indicated to the right of the forest plot, and 
the 95% confidence intervals are shown as horizontal bars. The totals are 
summarized by random-effect meta-analysis. (E) Spatial transcriptomics analysis 
of CosMx lung cancer data, showing spatial separation of CXCL9* versus SPP1* 
TAMs density in the same tumor sample (left), quantified by the same analysis 
as in Fig. 3B. Enrichment of SLC2AI* cells around SPP1* TAMs and INFG* T cells 
around CXCL9* TAMs are also observed (middle and right, respectively). 

(F) Assessment of overall survival prognostic impact of bulk RNA-seq CS ratio in 
HNSCC cohorts (as in Fig. 1, B to E), simultaneously with an EMT signature 

as competing explanatory variable in Cox regression analysis. HR, hazard ratios. 
Wald 95% confidence intervals are shown as horizontal bars. (G) As in (F), 
applied to TCGA pan-cancer collection. (H) As in (F), on HNSCC cohorts 
comparing CS ratio pairwise to single-gene expression of IFNG and CD8A as well 
as the effector cells and T helper 1 (Th1) signatures from Bagaev's TME signature 
collection. (I) As in (H), applied to TCGA pan-cancer collection. 
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TAM state. Therefore, instead of differentiat- 
ing into highly polarized states, all sublineages 
of TAMs can activate the CS axis more or less 
strongly. This is consistent with the memory- 
less phenotypic plasticity that macrophages 
exhibit in vitro in response to different stimuli 
(29) and suggests that pathways associated 
with CS regulation can be transiently activated 
in many cell types in response to orchestrated 
extracellular signaling networks independent 
of intrinsic lineage differentiation programs. 
Defining whether the CS ratio can be used as a 
bona fide prognostic or predictive marker goes 
beyond the scope of this study; nevertheless, 
we have provided a simple testable hypothesis 
involving the ratio of two genes, which may be 
measured by histology or quantitative polymer- 
ase chain reaction (qPCR) analysis. We show 
that axes orthogonal to CS also exist in TAMs, 
for example those related to tissue-specific func- 
tions, and/or in other cell types; these additional 
axes require investigation and might be com- 
bined with CS to define nonredundant TME 
variables. In any case, the results presented 
here reveal that CS TAM polarity identifies a 
large co-regulated network of cellular states 
that control human cancers and is thus a sim- 
ple, but critical, determinant of the TME. 

Our results have implications for how TAMs 
should be studied, particularly in the context 
of single-cell transcriptomics. First, for the 
analysis to have clinical relevance, it should 
focus on the expression of certain genes and 
the presence of certain signaling pathways 
in TAMs rather than solely on the identifi- 
cation of TAM states and their abundances. 
Second, TAM activity should be considered 
at the patient level because it is an important 
factor to explain transcriptomic heterogeneity 
as well as clinical outcome. Third, the patient- 
level approach also permits the study of co- 
regulation of various genes in different cell 
types and intercellular communication in the 
TME. These considerations have implications 
not only for better understanding of the TME 
at the time of cancer diagnosis but also for 
studying the efficacy of TAM-targeting thera- 
pies or other cancer treatments that may in- 
fluence or be influenced by TAMs. 

Considering the TME as an ecosystem of in- 
terdependent elements, perturbation of any of 
them should have circular effects. For example, 
our results showing IFN-y-induced CXCL9 (and 
hypoxia-induced SPP) expression by macro- 
phages suggest that CS TAM polarity is a con- 
sequence of signals from the TME. However, 
the resulting TAM phenotypes could contribute 
to the amplification of TME features, as indi- 
cated by the acquisition of distinct hallmark 
gene programs in TAMs in cs and CS” tumors. 
Thus, TAM polarization is likely to be both a cause 
and a consequence of the TME composition, 
consistent with the notion that macrophages 
are plastic cells that tailor their responses to 
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their environment (50). By extension, the entire 
network associated with CS TAM polarity could 
be induced or disrupted by multiple therapeu- 
tic means. Overall, insights gained from this 
work could help dissect the TME in various 
indications and have broad implications for 
our understanding of cancer biology and the 
development of precision medicine. 
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Mechanistic snapshots of rhodium-catalyzed 
acylnitrene transfer reactions 


Hoimin Jung’, Jeonguk Kweon*, Jong-Min Suh’, Mi Hee Lim’, Dongwook Kim?"*, Sukbok Chang”?* 


Rhodium (Rh) acylnitrene complexes are widely implicated in catalytic C-H amidation reactions but have 
eluded isolation and structural characterization. To overcome this challenge, we designed a chromophoric 
octahedral Rh complex with a bidentate dioxazolone ligand, in which photoinduced metal-to-ligand charge 
transfer initiates catalytic C-H amidation. X-ray photocrystallographic analysis of the Rh-dioxazolone 
complex allowed structural elucidation of the targeted Rh-acylnitrenoid and provided firm evidence that 
the singlet nitrenoid species is primarily responsible for acylamino transfer reactions. We also monitored 
in crystallo reaction of a nucleophile with the in situ-generated Rh-acylnitrenoid, which provided a 
crystallographically traceable reaction system to capture mechanistic snapshots of nitrenoid transfer. 


dentifying the structural and electronic 

nature of reactive intermediates in cata- 

lytic processes is essential for elucidating 

the reaction mechanisms and subsequent- 

ly designing new catalysts with improved 
efficiency and selectivity. Such an endeavor 
has been one of the central research focuses 
in the field of nitrene and carbene chemistry, 
wherein metal-nitrenoid and carbenoid spe- 
cies are regarded as key intermediates in 
transition metal-catalyzed hydrocarbon func- 
tionalizations (Fig. 1A) (7-5). Despite major 
advances in application of this valuable trans- 
formation to aryl, alkyl, and sulfonyl-nitrene 
precursors (6-9), experimental observation of 
the reactive metal-nitrenoids had long remained 
a challenge because of the highly transient 
nature of these intermediates. Nonetheless, 
by relying on the several precedents of metal- 
imido complexes featuring stabilized metal- 
nitrogen multiple bonding interactions (10-16), 
systematic ligand design combined with mod- 
ern analytical technologies has enabled the 
spectroscopic detection (17-20) or isolation of 
certain late transition metal-based nitrenoids 
(21-24). 

One of the key strategies that proved fruitful 
in this demanding task involves ligand engi- 
neering to control the coordination sphere and 
stabilize the target complexes. For instance, 
Betley and colleagues succeeded in the isola- 
tion of a copper-nitrenoid species by using 
sterically encumbered dipyrrin ligands and 
rationalized it as a copper(I)-supported triplet 
aryl nitrene species (25). Munz and colleagues 
highlighted access to reactive palladium(II)- 
sulfonylimido (26) and palladium(0)-aryl/ 
alkylnitrene species (27) with potential cata- 
lytic applications. In addition to these synthe- 
tic approaches, Powers and co-workers observed 
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the photoinduced formation of an Rh,-nitrenoid 
species along with released N. from the cor- 
responding Rh,-azide complexes through single- 
crystal x-ray diffraction (28, 29). Since these 
elegant achievements, various transition metal- 
based nitrenoids, nitrides, or metallonitrenes 
have been reported by using the x-ray photo- 
crystallographic approach with metal-azides 
or metal-azido precursors (30-32). 

Despite the aforementioned progress that 
enabled the identification of metal-nitrenoid 
intermediates, the related investigations on 
acylnitrenoid congeners have lagged behind, 
primarily because the N-acyl substituent 
markedly destabilizes the nitrenoid species 
and engenders various decomposition path- 
ways. Only a handful of catalytic systems have 
been reported by using traditional N-acylamino 
sources such as acyl azides (33), hydroxycar- 
bamate derivatives (34), or halocarbamates (35), 
which often suffer from capricious thermal and 
photochemical properties (36, 37). More recently, 
1,4,2-dioxazol-5-ones, so-called dioxazolones, 
have gained broad attention in the synthetic 
organic community as a more user-friendly yet 
highly reactive N-acylamino source, easily 
accessible from abundant carboxylic acid feed- 
stocks, that releases CO, as the only by-product 
(Fig. 1B) (38). Given these attractive features, 
Bolm’s pioneering work on sulfur imidation 
by using ruthenium photocatalysis (39) as well 
as our C-H amidation using half-sandwich, 
d transition metal catalysts (38) sparked the 
use of dioxazolones as a practical and versatile 
N-acylamino precursor. Indeed, the wide util- 
ity of dioxazolones is reflected in their appli- 
cation as reagents by more than 120 research 
groups since 2014 (40, 42), which thereby re- 
sulted in the invention of diverse synthetic meth- 
odologies for the installation of N-acylamino 
fragments that have been recognized as an 
important scaffold in clinically approved phar- 
maceuticals and bioactive natural products (42). 

Despite these synthetic advances, there is 
still a lack of mechanistic probes to validate 
the intermediacy of metal-acylnitrenoids, which 


bea i updates 
observed reactivity outcomes obtained sliecatesaa) 


the inner- (38, 43, 44) and outer-sphere C-H 
amidation (45-48), and reaction with an ex- 
ternal nucleophile (49). Moreover, the mode of 
dioxazolone activation to access such interme- 
diates is also not certain, although a simple metal- 
nitrogen coordination and decarboxylation se- 
quence is the generally accepted mechanism (38). 

To this end, we now present a chromophoric 
rhodium complex that led us to tackle these 
long-standing challenges (Fig. 1C). The photo- 
crystallographic experiments of this octahedral 
Rh complex that incorporates a bidentate di- 
oxazolone enabled structural characterization 
of a formal Rh-acylnitrene intermediate, which 
was identified as an active intermediate for 
photocatalytic C-H amidation reactions. In 
addition, we accumulated support for the 
hypothesis that the observed nitrenoid species 
is equivalent to the reactive catalytic interme- 
diate accessed in the solution-phase reactions 
because we demonstrated sequential bond- 
breaking and forming events in the crystalline 
matrix by means of Rh-acylnitrenoid transfer 
to a cocrystallized external nucleophile, which 
thus provides a crystallographically trackable 
solid-state reaction system to observe fleeting 
catalytic intermediates. 


Access to Rh-dioxazolone complexes and their 
photochemical properties 


Because dioxazolones have found a wide range 
of applications in acylnitrenoid transfer reac- 
tions, it is crucial to validate their intermedi- 
acy and gain a better understanding of their 
electronic properties. However, numerous at- 
tempts to isolate or even spectroscopically 
detect the putative metal-acylnitrenoid inter- 
mediates failed continuously in previous sys- 
tems (47). One of the critical difficulties was 
the synthesis of transition metal-bound dioxa- 
zolone adducts, which was plagued by the low 
binding affinity of dioxazolones to the metal 
center (38). Preventing the rapid decomposi- 
tion of metal-acylnitrenoids such as the Curtius- 
type intramolecular rearrangement was deemed 
another challenge (Fig. 2A). Hence, a deli- 
cately designed catalytic system was required 
to enable the formation of transition metal- 
bound dioxazolone precursors and selectively 
trigger their subsequent decarboxylation into 
the posited metal-acylnitrenoid species in a 
controlled manner. 

To selectively induce decarboxylation, we 
prepared a photoresponsive octahedral rho- 
dium complex with a bidentate dioxazolone 
ligand. This approach was inspired by Meg- 
gers’ work on a bis-cyclometalated rhodium 
catalyst system bearing benzothiazole ligands 
for intermolecular [2+2] photocycloadditions 
(50). We believed that solid-state photocrystal- 
lography of our Rh-dioxazolone complexes 
could finally access the long-sought nitrenoid 


lofs8 


RESEARCH | RESEARCH ARTICLE 
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Fig. 1. Characterizing the intermediacy of transition metal-nitrenoids in catalytic C-H amination. (A) C—H amination via a nitrenoid transfer pathway. IPh, 
phenyliodinane; R-NHOR', hydroxylamine derivatives. (B) Dioxazolones as metal-acylnitrenoid precursors. Nu, nucleophile. (€) Strategic design of chromophoric 
Rh-dioxazolone complexes to detect and characterize Rh-acylnitrenoids that enable photocatalytic C-H amidation. hv, light; Me, methyl. 


intermediate: Structural confinement in the 


approach, we subjected the obtained Rh- 


ways of [Rh2]. The simulations predicted that 


crystalline matrix would prevent undesired trans- 
formations and enable in situ monitoring of the 
crystal-to-crystal conversion. With a hypothesis 
that the presence of a pyridinyl substituent would 
improve the binding affinity of dioxazolones 
to the metal center, we attempted to synthe- 
size the designed complexes that might exhibit 
the desired light-harvesting property (Fig. 2B). 
We successfully obtained cationic rhodium- 
bound dioxazolone complexes [Rh2]PF¢ (PF, 
hexafluorophosphate) and [Rh2]BArF (BArF, 
tetrakis[3,5-bis(trifluoromethyl)phenyl Jborate) 
from dioxazolone (1) and the corresponding 
bis-cyclometalated rhodium(I) complex PRHITX 
in dichloromethane. 

To verify the light-absorption properties 
required for the designed photocrystallographic 
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dioxazolone complex [Rh2]BArF to ultra- 
violet-visible (UV-Vis) spectroscopic analysis 
(Fig. 2C). [Rh2]BArF showed notable ab- 
sorption in the blue to UV-A regime, where- 
as a metal-free dioxazolone 1 displayed only 
marginal absorption. The absorption charac- 
ter at the near-UV region of [Rh2]BArF was 
attributed to metal-to-ligand charge transfer 
(MLCT) from the Rh-d orbital to the n* or- 
bital of the dioxazolone ligand (1) on the basis 
of time-dependent density functional theory 
(TDDFT) simulation (detailed in the supplemen- 
tary materials, section 7-2). To gauge whether 
this MLCT transition could indeed result in 
the formation of Rh-nitrenoids, we conducted 
further density functional theory (DFT) simula- 
tions to investigate the potential activation path- 


upon photoexcitation of singlet *[Rh2] to its 
triplet MLCT state 3[Rh2], 51.1 kcal/mol higher 
in energy, an asynchronous decarboxylation can 
proceed to form a triplet nitrenoid 3[Rh3] with 
a low barrier of 1.1 kcal/mol (Fig. 2D). By contrast, 
a concerted decarboxylation process from the 
singlet "[Rh2] displays a higher barrier of 
24.2 kcal/mol to furnish the corresponding 
singlet nitrenoid *{>Rh3]. The predicted energy 
difference between the triplet and singlet acyl- 
nitrenoid [Rh3] is only 1.7 kcal/mol. 


Photoinduced crystallographic access to an 
Rh-acylnitrenoid intermediate 


Having established the photolabile yet isolable 
Rh-dioxazolone system [Rh2], we were interested 
in accessing the sought-after Rh-acylnitrenoid 


2 of 8 


RESEARCH | RESEARCH ARTICLE 


Fig. 2. Access to 


A Challenges in isolating M-acyInitrenoids 


Rh-dioxazolone $ 
complexes and their | 2 oe Le X I Zz - Le 
photochemical _ te Px > = Vien ) SNES i oy | OM==N ee NES 
properties. (A) Key 2) ES Ro NZ =iL “| -Ol_.x | 4] =e Z| ' Da’) ae ee 
challenges in isolating acylnitrene precursors R re i Sep eat ' R isocyanate 
M-acylnitrenoids. (B) Synthe- challenge 1: low M-N binding affinity challenge 2: metal-acylnitrenoid instability 
sis of Rh-dioxazolone (difficult synthesis of M-dioxazolone precursors) (rapid decomposition) 
complexes [Rh2]X. : . 
(C) UV-Vis absorption B Synthesis of Rh-dioxazolone complexes C UV-Vis study 
spectra of 1, [Rh1]BArF, 5 
and [Rh2]BArF in CH2Clo < ~] a7 xX = mr % = a —— Dioxazolone (1) 
(500 uM). a.u., arbitrary unit. N-f O r, N~1 O ~ = eae 
(D) DFT-computed Gibbs Cy | 7 Me o“¥ \\ | jad 3 
energy profile of the “te dN NvO . = ee aS? 8 
decarboxylation step of Rh CHCl, WV 4 
[Rh2]; B3LYP-D3/cc-pVTZ OY | Ns Le | Lz g 
(-f)/LACV3P//B3LYP-D3/ ne-N Me n-N 
6-31G**/LACVP (e = 8.93 L» 1 i. 
for CH2Clz). MeCN, or [Rh1]x [Rh2]X (>95%) aie Lara ace 
300 320 340 360 380 400 420 440 460 480 500 


NCMe, acetonitrile; TS, 
transition state. 


AG(sol) 
(kcal/mol) 


(X = PF or BATF) 


crossing (ISC) 


J 
[Rh1] +1 
(9.14) 
9 -2MeCN 
® o-4 
hoNeMe |, fo TRn2] | 0 
NCMe yA (0.00) | ® \_o 
\A 
[Rh1] NSS 
1 Z 


intermediates through photoirradiation. 
However, all our attempts to capture these 
fleeting metal-acylnitrenoid intermediates in 
the solution phase proved unsuccessful. 
Consequently, we envisaged observing the 
Rh-acylnitrenoid species through an x-ray 
photocrystallographic approach (28, 29), given 
that [Rh2] shows a low barrier for photo- 
chemical activation. In the ensuing efforts to 


Jung et al., Science 381, 525-532 (2023) 


4 August 2023 


monitor the in crystallo formation of Rh- 
acylnitrenoids by using [Rh2]BArF crystals 
at a temperature of 100 K and photoexcitation 
with an external light source (370 nm), we were 
able to observe the formation of composi- 
tionally disordered Rh-acylnitrenoid along with 
an extruded CO, molecule [Rh3](BArF)(CO,) 
in the crystalline matrix (Fig. 3A; see fig. S16 


1TRh2-TS] 
(24.18) 


for the experimental setup). 
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The progress of this in crystallo conversion 
could be monitored by the sequential collection 
of x-ray diffraction data over time with continu- 
ous irradiation with a light-emitting diode (LED) 
light, and 33% of Rh-dioxazolone [Rh2](BArF) 
was converted into Rh-acylnitrenoid with ex- 
truded CO, [Rh3](BArF)(CO,) after 30 min 
(see the supplementary materials for the rel- 
ative conversion of crystal over time; fig. S19). 
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Fig. 3. In crystallo access of Rh-acylnitrenoid. (A) Formation of [Rh3](BArF)(COz2) from [Rh2]BArF through x-ray photocrystallography. In the presented 
structure of [Rh3](BArF)(COz), only disordered Rh-acylnitrenoid and released CO> structures are shown. The hydrogen atoms and disordered CF3 
groups were omitted in the crystal structures for clarity. (B) MALDI-MS evidence for COz loss from [Rh2]BArF upon photoirradiation. calcd, calculated: 


exp, experimental; sim, simulated. 


Upon photoinduced decarboxylation, the 
Rh-NI1 bond length was notably contracted 
from its original length of 2.153(2) A (Rh- 
Nia in [Rh2]BArF) to 2.02(2) A [Rh-Nib in 
[Rh3](BArF)(CO,)]. Of note, the observed 
bond length of Rh-Nib is substantially longer 
compared with the reported terminal Rh-alkyl 
imido complex [1.780(2) A] (15), yet it is in a sim- 
ilar range with the photocrystallographically 
accessed Rho-alkyl and Rh,-aryl nitrenoids [2.12 
(2) and 2.055(4) A, respectively] (28, 29). By con- 
trast, the bond lengths of Nia-Cla [1.305(3) A] 
and Cla-Ola [1.348(3) A] of [Rh2]BArF were 
not noticeably altered [d(N1b-Cib) = 1.30(3) 
A and d(Clb-O1b) = 1.34(3) AJ, which suggests 
the formal double-bond character of N1b-Clb 
and single-bond character of Clb-Olb. This struc- 
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tural aspect would be attributed to the de- 
localization of the double-bond character of 
amide fragments (24). 

Also, the bond lengths of the trapped CO, 
molecule in [Rh3](BArF)(CO,) [d(C3b-O2b) = 
1.07(2) A and d(C3b-O3b) = 1.16(2) A] were 
consistent with the previously reported C=O 
bond length in the crystal structure of CO, 
(1.155 A) (51). The extruded CO, is positioned be- 
tween the two CF; groups of the BArF counter- 
anion, which thus implies that the hydrophobic 
and bulky character of the counter-anion plays 
a beneficial role in capturing the released CO 
in the crystal structure. The current photocrystal- 
lographic observation bolsters the proposed 
mechanism of dioxazolone activation through 
decarboxylation upon metal-dioxazolone coor- 


dination, which thereby rules out an alterna- 
tive pathway involving oxidative addition of 
the N-O bond of dioxazolones (52). 

We also probed the photoactivation of Rh- 
dioxazolone complexes by matrix-assisted laser 
desorption/ionization mass spectrometry 
(MALDI-MS) measurements (Fig. 3B and 
fig. S9). Upon laser irradiation (355 nm), the 
['*N-Rh2]BArF complex produced a peak 
in the mass spectrum [mass/charge ratio (7m/z) = 
509.03] that corresponded to a decarboxylated 
cationic Rh-acylnitrenoid species [**N-Rh3] 
(calculated m/z = 509.0592). Under otherwise 
identical conditions with isotopically N-labeled 
(°N-Rh2]BArF, the resulting peak assigned 
as [°N-Rh3] (m/z = 510.05) displayed a 1-Da 
shift from ["*N-Rh3], which indicates that the 
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observed [Rh3] ion contains the N-labeled 
nitrogen atom originating from the dioxazolone 
ligand. In addition, the isotope distribution of 
both peaks was consistent with the simulated 
mass spectra of [°*N-Rh3] and [2°N-Rh3]. 


Exploration of photoinduced catalytic 
C-H amidation 


Encouraged by the successful trapping of the 
Rh-acylnitrenoid species [Rh3] by means of 
x-ray photocrystallography, we sought to ob- 


serve its reactivity toward hydrocarbon sub- 
strates (Fig. 4A). Upon 370-nm irradiation of 
the mixture of [Rh1]PFg and dioxazolone 
1 in dichloromethane with benzene as a sub- 
strate, quantitative C(sp”)-H amination was 
observed (([Rh4]PF¢). Similarly, stoichiomet- 
ric C(sp*)-H amination was also successful with 
cyclohexane as substrate (TRh5]PF¢, 72%), and 
the structures of the obtained products were 
confirmed by x-ray crystallographic analysis 
(tables S7 and S8). Irradiation of [Rh1]PF¢ 
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and dioxazolone 1 in dichloromethane only 
without hydrocarbon coupling partners led 
to the decomposition. 

This observation of the photoinduced stoi- 
chiometric acylnitrene transfer reactivity of 
the current Rh-dioxazolone system led us to 
examine the feasibility of a catalytic C-H ami- 
dation. Indeed, by using [Rh1]BArF as a 
catalyst, various arenes could be function- 
alized to produce aniline derivatives under 
the photochemical reaction conditions (Fig. 4B). 
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Fig. 4. Photochemical reactivity of Rh-dioxazolone complexes. (A) Photo- 
chemical, stoichiometric reactivity of [Rh1]PF, and 1. Reactions were conducted 
in 0.05-mmol scale in CH2Clz (0.02 M, 2.5 ml) and substrate (0.5 ml) at 

room temperature under 370-nm irradiation for 1 hour; yields were based on 
1H NMR analysis of the reaction mixture by using an internal standard 
(dibromomethane). (B) Expansion to photocatalytic C-H amidation. Unless 
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otherwise indicated, reactions were run with 10 mol % of [Rh1]BArF catalyst 
in CH2Clo (0.1 M) at room temperature under 427-nm irradiation for 12 hours; 
the data are reported as percent isolated yields. *20 mol % [Rh1]BArF 
catalyst was used. ‘CHClo/substrate = 2:1 (v/v) mixture was used. HO, 
hydroxy; MeO, methoxy; MeO2S, methylsulfonyl; rt, room temperature; SCXRD, 
single crystal x-ray diffraction. 
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Fig. 5. Mechanistic studies of the Rh-acylnitrene transfer to C-H bonds. (A) C-H amidation toward a substrate with a chiral center. e.e., enantiomeric excess; 
e.r., enantiomeric ratio. (B) Intermolecular KIE. (©) DFT computation results on two potential mechanisms of C-H amidation from [Rh3]; B3LYP-D3/cc-pVTZ(-f)/ 


LACV3P//B3LYP-D3/6-31G**/LACVP (e = 8.93 for CH2Clz). (D) CASSCF(16e,120)-level calculations on DFT-optimized structure of *"[Rh3]. Numbers in parentheses 


indicate Lowdin orbital population. 


Irradiation of dioxazolone (1) and 10 equiv 
of benzene in dichloromethane gave 71% of C 
(sp”)-H amidation product 2 in the pres- 
ence of 10 mol % of [Rh1]BArF. The 'N- 
labeled aniline **N-2 could also be obtained 
in 70% yield by using the isotopic dioxa- 
zolone. p-Xylene was selectively converted to 
C(sp”)-H amidation product (3, 43%) with- 
out observing an amination at the C(sp*)-H 
bonds. This protocol was applicable to arenes 
containing electron-donating (OMe, 4) or 
electron-withdrawing groups (Cl, 5). A 1,4- 
disubstituted arene carrying both electron-rich 
and deficient groups was found to be amidated 
favorably at the C(sp”)-H bond adjacent to 
the electron-donating substituent, and the 
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structure was characterized by x-ray crystal- 
lography (6, 50%). When phenol was subjected 
to the reaction, the ortho-C-H bond was se- 
lectively amidated to yield 7 in 66% yield. 
When 1,2-dichlorobenzene was used as the sub- 
strate, a separable regioisomeric mixture of 
amidated products was obtained (8 and 9). 
The current photochemical protocol was 
also active toward selective C(sp®)-H amida- 
tions, as demonstrated in the reactions with 
cyclohexane (10, 69%) and cyclopentane (11, 
46%), yet required increased catalyst and sub- 
strate loading. Intriguingly, when testing ada- 
mantane as the substrate, tertiary C(sp®)-H 
amidation (12) was favored over the secon- 
dary counterparts (13), with the statistical 


3 


selectivity ratio of 3°/2° C-H amidation 
being ~8:1 under the current photochemical 


conditions. 


Mechanistic considerations for the 
acylnitrenoid transfer to C-H bonds 


With the successful validation of catalytic re- 

activity as presented in the above brief reaction 

scope, we next sought to investigate the electronic 
structure and spin state of Rh-acylnitrenoids 
pertinent for the catalytic C-H amidation re- 
actions. By considering the low triplet and 
singlet energy gap of [Rh3] [AGC[Rh3]) - 
AG(?[Rh3]) = 1.7 kcal/mol] (Fig. 2D) and the 
high spin-orbit coupling tendency of second-row 
transition metal complexes, we hypothesized 
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Fig. 6. Photoinduced formation of Rh-dioxazole from Rh-dioxazolone. (A) Solution-phase LED-NMR monitoring of the formation of a dioxazole-bound complex [Rh6-dg]PF. 
from [Rh2]PF¢. CDs, trideuteriomethyl. (B) DFT-computed Gibbs energy profile on the formation of Rh-dioxazole species [Rh6] from Rh-nitrenoid [Rh3]. B3LYP-D3/cc-pVTZ(-f)/ 
LACV3P//B3LYP-D3/6-31G**/LACVP (e = 20.7 for acetone) (C) In crystallo monitoring of [Rh2](PF.)(acetone) to [Rh6]PF¢. In the presented structure of [Rh6]PF.¢, only 


the disordered Rh-dioxazole structure is shown. The hydrogen atoms and disordered PF, were omitted in the presen 


that both singlet and triplet nitrenoids could 
be energetically accessible under our photo- 
chemical conditions. 

To extract the electronic feature of the rho- 
dium nitrenoid, some elucidative mechanistic 
probes were examined under catalytic con- 
ditions. When a substrate that contained an 
enantioenriched tertiary C(sp*)-H center was 
subjected “to the current reaction conditions,” 
complete retention of the stereochemistry was 
observed in the aminated product (R14 (Fig. 5A). 
This result suggests that the reaction proceeds 
via a concerted C-H insertion pathway that 
involves a singlet nitrenoid *[Rh3] because 
triplet nitrenoid is known to react through a 
hydrogen atom abstraction (HAA) that would 
lead to a loss of stereochemical integrity (53). 
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In addition, an intermolecular competition ex- 
periment between cyclohexane and cyclohexane- 
diz substrates yielded a kinetic isotope effect 
(KIE) value of 2.13 at room temperature (Fig. 
5B), which is noticeably lower than the typical 
KJE range of ~8 to 11 obtained from the HAA/ 
radical rebound mechanism (54). Although we 
cannot entirely rule out the possibility of HAA 
and fast radical recombination at this stage, 
computational studies on the C-H amidation of 
[Rh3] with cyclohexane as the model substrate 
indicated that the insertion barrier on the sin- 
glet surface is 2.8 kcal/mol lower than that of 
the HAA pathway on the triplet surface (Fig. 5C). 

Although the photoactivation of dioxazo- 
lones is initiated on the triplet surface (Fig. 
2D), the above experimental mechanistic probes 
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ited crystal structures for clarity. 


strongly suggest that the observed reactivity 
toward the subsequent C-H amidation is pro- 
moted instead by a singlet Rh-acylnitrenoid 
1[Rh3] after spin crossover. To gain a deeper 
understanding of the electronic structure of 
the proposed Rh-acylnitrenoid species, we 
conducted complete active space self-consistent 
field (CASSCF) calculations on the DFT-optimized 
structure of *{Rh3]. As illustrated in Fig. 5D, 
the CASSCF(16e,120) computation gives a lead- 
ing configuration (81%) of Rh(IID)-acylnitrene 
character, whereas a minor contribution (7%) 
of Rh(V)-acylimido character coexists as a multi- 
configurational state of Rh(II])-acylnitrene. In 
the leading subspace, the presence of three 
sets of Rh-centered occupied orbitals [d, and 


dy, orbitals and one Rh(d,,)-N(p,) m-bonding 
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orbital] and one empty N-centered Rh(d,,)- 
N(px) x*-antibonding orbital suggests a dg Rh 
(II1})-acylnitrene configuration. Noteworthy 
is that the computed L6wdin bond order of 
Rh-N (1.19) indicates the existence of weak 
multiple bonding interaction between the Rh 
metal center and the nitrenoid unit. 


Crystallographic monitoring of Rh-acylnitrene 
transfer toward external nucleophiles 


On the basis of the rationale outlined above 
regarding the involvement of singlet Rh- 
acylnitrene species in the present photo- 
induced C-H amidation, it was anticipated 
that this intermediate would also exhibit elec- 
trophilic reactivity toward external nucleophiles 
(49). Indeed, such reactivity was successfully 
showcased by a solvolysis-type reaction of 
Rh-dioxazolone complexes in acetone solvent 
(Fig. 6A). When monitoring the progression of 
[Rh2]PF¢ photolysis in acetone-dg by using 
a 365-nm LED-nuclear magnetic resonance 
(NMR) setup, we observed a stoichiometric 
transformation of Rh-dioxazolone into an 
acetone-incorporated Rh-dioxazole complex 
([Rh6-d,]PF.¢) in 75% yield upon 20 min of 
irradiation. This reactivity can be ascribed to 
a sequential process of initial nitrenoid for- 
mation, followed by a reaction with acetone 
as a formal O-nucleophile. DFT study of this 
event revealed that the formation of dioxazole 
[Rh6] can be best described as a stepwise 
N-O bond formation, followed by a ring-closing 
C-O bond formation with an overall barrier 
of 11.0 kcal/mol (Fig. 6B). By contrast, on the 
triplet energy surface, the analogous N-O 
bond-forming event requires a much higher 
barrier of 42.7 kcal/mol (fig. S14), which thus 
indicates that the singlet nitrenoid *=[Rh3] 
would be solely responsible for this process. 

Although the combined experimental and 
computational results clearly demonstrate the 
intermediacy of Rh-acylnitrene in the solution- 
phase photochemical reactions of both C-H 
amidation and acetone incorporation, we were 
curious to explore whether the nitrene trans- 
fer could also be monitored in the solid state. 
We were able to obtain cocrystals of the Rh- 
dioxazolone complex with an acetone mole- 
cule [Rh2](PF,)(acetone) from [Rh1]PF, 
and dioxazolone 1 in cosolvent of acetone and 
cyclohexane. Intriguingly, in the crystal struc- 
ture of [Rh2](PF,)(acetone), the acetone 
molecule is positioned close to the dioxazo- 
lone core, with a distance of 2.695 A between 
the oxygen atom of acetone and the centroid 
of the dioxazolone ring (Fig. 6C). 

An x-ray photocrystallographic experiment 
was subsequently performed on [Rh2](PFg) 
(acetone). Notably, upon photoirradiation at 
135 K (370 nm), the single-crystal x-ray dif- 
fraction data revealed that the cocrystallized 
acetone was indeed incorporated into the Rh- 
acylnitrenoid generated in situ, which led to 
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19% of compositionally disordered dioxazole 
moiety ((Rh6]PF¢) after 30 min. During this 
photoinduced in crystallo conversion, two 
bonds of the dioxazolone motif were broken 
while two new bonds were formed, producing 
the dioxazole scaffold. This crystal-to-crystal 
transformation involves a series of mecha- 
nistic steps that include the generation of a 
highly transient intermediate and then inter- 
molecular capture by an external molecule, 
which all occur in the solid state. Of note, 
unlike the case of [Rh3](BArF)(CO,), the 
extruded CO, was not detected in the crystal 
structure of [Rh6]PFg. Furthermore, this 
crystalline reactivity is in line with our exper- 
imental results observed in the solution phase 
(Fig. 6A). In addition to the intrinsically highly 
electrophilic nature of the in situ-generated 
Rh-nitrenoid, the photochemical formation of 
[Rh6]PF, even in the solid phase can be at- 
tributed to the close distance between the ace- 
tone and dioxazolone core in the crystal 
structure of [Rh2](PF,)(acetone). We be- 
lieve that such monitoring of the reaction 
progress in the crystalline matrix is an addi- 
tional compelling mechanistic snapshot that 
validates the Rh-acylnitrenoid intermediacy in 
the acylamino group transfer reactions. 
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Malaria control demands the development of a wide range of complementary strategies. We describe 
the properties of a naturally occurring, non—genetically modified symbiotic bacterium, Delftia 
tsuruhatensis TC1, which was isolated from mosquitoes incapable of sustaining the development 
of Plasmodium falciparum parasites. D. tsuruhatensis TC1 inhibits early stages of Plasmodium 
development and subsequent transmission by the Anopheles mosquito through secretion of a small- 
molecule inhibitor. We have identified this inhibitor to be the hydrophobic molecule harmane. We 
also found that, on mosquito contact, harmane penetrates the cuticle, inhibiting Plasmodium 
development. D. tsuruhatensis TC1 stably populates the mosquito gut, does not impose a fitness 
cost on the mosquito, and inhibits Plasmodium development for the mosquito’s life. Contained field 
studies in Burkina Faso and modeling showed that D. tsuruhatensis TC1 has the potential to 
complement mosquito-targeted malaria transmission control. 


pproximately half of the world’s popu- 
lation live at risk of malaria. In 2021, an 
estimated 247 million malaria cases in 
84 endemic countries resulted in 619,000 
deaths, primarily children under the age 
of five (1). Little progress has been made in 
recent years to reduce the burden of malaria 
(1), emphasizing the urgent need to develop 
and implement new control strategies. 
Anopheline mosquitoes are obligatory vectors 
for transmission of Plasmodium, the causative 
agent of malaria. Whereas large-scale distribu- 
tion of long-lasting insecticide bednets and 
indoor residual spraying have been successfully 
used to reduce mosquito populations (2-8), 
insecticide resistance is gradually diminish- 
ing the effectiveness of these approaches (9). 
In addition, parasite resistance to leading anti- 
malaria drugs, such as artemisinin-based 
combination therapy, is compromising the 
effectiveness of malaria treatment (10, 11). 
Alternative strategies such as those aimed at 
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targeting the malaria parasite within the mos- 
quito could avoid selection of such resistances. 

Here we identify and characterize a symbiotic 
bacterium, Delftia tsuruhatensis TCl, isolated 
from mosquitoes of an Anopheles stephensi 
colony that had lost the ability to sustain in- 
fection of Plasmodium falciparum parasites. 
D. tsuruhatensis TC1 secretes a small hydro- 
phobic methylated B-carboline called harmane 
that inhibits parasite development when se- 
creted in the mosquito gut, as well as on con- 
tact when it penetrates the mosquito cuticle. 
Data from preliminary contained field studies 
in Burkina Faso and mathematical model- 
ing support the translational potential of 
D. tsuruhatensis TC1 to control malaria in an 
endemic setting. 


Bacterial colonization of the anopheline midgut 
and inhibition of Plasmodium development 


An An. stephensi mosquito colony at the GSK 
R&D, Tres Cantos insectary (Spain), was observed 
to gradually lose susceptibility to P. falciparum 
infection with no apparent loss of fitness. 
To investigate factors affecting this loss of 
susceptibility, we examined microbiota from 
mosquitoes and breeding environments. 
D. tsuruhatensis was predominantly found in 
larvae, pupae, and adult mosquito midguts, 
as well as in water from larval breeding pans. 
D. tsuruhatensis TC1 was isolated from mos- 
quito midguts, and an initial proof of principle 
study confirmed that this bacterium was re- 
sponsible for reducing parasite numbers in An. 
stephensi mosquitoes at the GSK Tres Cantos 
facility (12) (details in the materials and meth- 
ods section of the supplementary materials). 
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mosquito vectors, this bacterium was admin- 
istered to female Anopheles gambiae mosqui- 
toes of the Johns Hopkins colony by means of 
cotton balls soaked with a bacteria and sugar 
mixture, followed by a Plasmodium-infected 
blood meal. Although D. tsuruhatensis TC1 
colonizes mosquito guts in small numbers (<20 
bacteria per midgut), this number increased 
by about 100-fold after a blood meal (table 
$1). Colonization efficiency was high, as bacte- 
ria were present in almost all mosquitoes. 
D. tsuruhatensis TC1 strongly inhibited P. fal- 
ciparum (73%) and P. berghei (47%) oocyst for- 
mation (Fig. 1, A and B) in both An. gambiae and 
An. stephensi (fig. SI). The capacity of mosqui- 
toes carrying D. tsuruhatensis TC1 to transmit 
P. berghei from infected to naive mice (13) was 
also assessed (Fig. 1C). Only 33% of mice were 
infected when bitten by P. berghei-infected mos- 
quitoes carrying D. tswruhatensis TC1 compared 
with 100% when bitten by control mosquitoes 
not colonized by D. tswrwhatensis TC1 (Fig. 1D), 
in accordance with the reduction of sporozoite 
prevalence and sporozoite load in salivary 
glands (Fig. 1E). Once ingested, the bacterium 
inhibits P. falciparum development for at least 
16 days after colonization, and possibly for the 
life of the mosquito (Fig. 1F). 

An ultrastructure study of D. tswruhatensis 
TCI maintained in culture revealed a wide di- 
versity of bacterial shapes, from elongated rod- 
shaped to roundish or oval cells (Fig. 1G, left 
panel). The bacterial cells varied in size from 
0.7 to 4.2 um in diameter, and the length of 
some cells reached 24 um (Fig. 1G, middle and 
right panels), suggesting a large space occu- 
pancy in mosquito cavities. This morpholog- 
ical heterogeneity is atypical compared with 
other D. tsuruhatensis strains, such as HR4, 
for example, which exhibits regular rod-shape 
cells with flagella (74). 


Fitness cost of D. tsuruhatensis TC1 to 
mosquitoes and lack of transmission 


We measured survival of An. gambiae with or 
without D. tsuruhatensis TC1 and found that 
D. tsuruhatensis TC1 did not affect mosquito 
longevity (fig. S2, A and B). We measured fe- 
cundity, by comparing the number of eggs laid, 
and fertility, by comparing the percentage of 
hatched eggs, of An. Gambiae with and without 
D. tsuruhatensis TC1l. We found no difference 
in fecundity or fertility in any of the test groups 
(fig. S2, C and D). The presence of bacteria did 
not significantly affect blood feeding rate nor 
the size of the blood meal (fig. S2, E and F). 
Bacteria were not released into the feeder when 
mosquitoes carrying D. tsuruhatensis TC1 bacte- 
ria took a blood meal (fig. S3), indicating that 
D. tsuruhatensis TC1 cannot be transmitted to 
humans by a mosquito bite. In summary, we 
found that, in the laboratory, D. tswrwhatensis 
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TC1 bacteria do not impair mosquito survival, 
fecundity, fertility, or blood feeding and that 
the bacteria are unlikely to be released into a 
mammalian host by mosquito bite. 


Transstadial persistence of bacteria from 
larvae to adult mosquitoes 


To determine possible effects on mosquito lar- 
vae, different concentrations of D. tswruhatensis 


Fig. 1. D. tsuruhatensis TC1 inhibits Plasmodium 
development in, and transmission by, An. 
gambiae mosquitoes. P. falciparum (A) and 

P. berghei (B) oocyst formation in mosquitoes 
carrying D. tsuruhatensis TC1 bacteria is strongly 
inhibited compared with control mosquitoes 

or mosquitoes carrying Pantoea agglomerans. 
Data pooled from three (P. falciparum) and two 
(P. berghei) independent experiments. (C) Exper- 
imental design. Two-day-old mosquitoes were fed 
on 5% sugar with or without 10’ bacteria/ml of 
Delftia or Pantoea overnight. Two days later, 
mosquitoes were fed on the same P. berghei- 
infected mouse, assuring that all mosquitoes 
ingested infected blood with the same parasitemia. 
After 21 to 23 days, when sporozoites had reached 
the salivary glands, mosquitoes were randomly 
selected and allowed to bite naive mice (three 
mosquitoes per mouse). (D) The parasitemia of 
each mouse was followed daily for 14 days. Bars 
represent standard error of the mean. (E) The 
number of sporozoites per salivary gland. Data 
pooled from three independent experiments, 
each using five mice per challenged group for a 
total of 15 mice. Statistical analysis was 
determined by the log-rank (Mantel-Cox) test 

(D) or by the Mann-Whitney U test (E). For (D), 
control and Delftia: P < 0.0001; control and 
Pantoea: P = 0.6702. For (E), control and Delftia: 
P < 0.0001; control and Pantoea: P = 0.2825. 
NS, not significant. (F) Inhibition by D. tsuruhatensis 
TC1 bacteria is long-lasting. D. tsuruhatensis TC1 
or Pantoea bacteria were administered (or not, as 
a control) to An. gambiae mosquitoes, and 16 days 
later all mosquitoes were infected with the same 
P. falciparum gametocyte culture. The oocyst 
number per midgut was determined 7 days later. 
Data pooled from three independent experiments. 
Number: number of mosquitoes analyzed; 
Prevalence: proportion of mosquitoes carrying one 
or more oocysts. Statistical analysis by Mann- 
Whitney U test. ***P < 0.001. (G) Transmission 
electron micrographs of an in vitro culture of 

D. tsuruhatensis illustrating various bacterial 
shapes (left and middle panels). The right panel 
shows a scanning electron micrograph of a long 
form of the bacterium. Scale bars, 2 um. 


TC1 (10%, 10°, and 10° bacteria/ml) were added to 
the rearing water of 2-day-old An. gambiae 
larvae, followed by daily monitoring of sur- 
vival. No or little effect was detected at 10* 
and 10° bacteria/ml, whereas substantial mor- 
tality was observed at 10° bacteria/ml (fig. $4). 
The reason for this effect is unclear. A flu- 
orescent protein gene was inserted into the 
D. tsuruhatensis TC1 genome to allow tracking 


P. falciparum 


Oocysts/midgut 


of the bacteria within mosquitoes. When added 
to the larval rearing water, D. tsuruhatensis TC1 
efficiently persisted during transstadial de- 
velopment to adult mosquitoes. All emerged 
adults had D. tsuruhatensis TC1 in their mid- 
guts (table S2). When fed to adult mosquitoes, 
the bacteria stably populated the gut but were 
not found in ovaries (table $3). By examin- 
ing the larval progeny of female mosquitoes 
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fed with fluorescent bacteria, we found that 
D. tsuruhatensis TC1 is not transmitted ver- 
tically (table $4). Mating of male mosquitoes 
carrying fluorescent D. tsuruhatensis with 
virgin females revealed that bacteria are not 
transmitted horizontally (sexually) from males 
to females (table S5). 


Bacterial inhibitor(s) of Plasmodium development 


D. tsuruhatensis TC cell-free culture superna- 
tant in a minimum M9 salt medium was added 
to P. falciparum-infected blood and fed to 
mosquitoes (Fig. 2A). D. tsuruhatensis TC1 
supernatant inhibited oocyst formation (97% 
of the mean) (Fig. 2B). When supernatant was 
passed through a 3 kDa cutoff filter, a similar 
inhibition was observed in the filtrate, imply- 
ing that the size of the active molecule is <3 kDa 
(Fig. 2B). Supernatant heated to 70°C was also 
inhibitory, indicating that the active component 
or components are stable at high temperatures 
(Fig. 2B). Dead bacteria showed some ability 
to block transmission (Fig. 2B), perhaps be- 
cause the inhibitor is retained within dead cells. 
Supernatant treatment with proteinase K did 
not alter Plasmodium inhibition, suggesting 
that the inhibitor is not proteinaceous (fig. S5). 

Dose-dependent inhibition of Plasmodium 
infection was observed by D. tsuruhatensis 
TC cell-free supernatant, such that 50 wl of a 
100-fold diluted supernatant mixed with 300 ul 


Pellet washed with M9 
medium, and diluted to 
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of P. falciparum gametocyte culture inhibited 
oocyst formation by ~60% (fig. S6). The effec- 
tiveness of supernatant to inhibit parasite de- 
velopment decreased during the first two days 
after administration, and inhibition was lost 
by day four (fig. S7). 

Formation of Plasmodium oocysts on the 
basal side of the mosquito midgut epithelial 
cells is preceded by a series of developmental 
steps within the blood bolus. Gametogenesis, 
fertilization, and zygote differentiation into mo- 
tile ookinetes all occur within the blood meal 
after mosquito intake (15). To understand which 
stage of parasite development is affected by 
the D. tswruhatensis TCl-secreted compound(s), 
mosquitoes that had been fed with superna- 
tant mixed with infectious gametocytes were 
dissected at 22 hours after feeding, and the 
number of ookinetes within the blood bolus 
was measured. Ookinete formation was strong- 
ly suppressed by the D. tsuruhatensis TC1 super- 
natant, indicating that inhibition occurs at an 
early stage (Fig. 2C). We found that the super- 
natant did not significantly inhibit male game- 
togenesis (a process known as exflagellation) 
(Fig. 2D). In summary, D. tsurwhatensis TC1 
secretes a nonproteinaceous molecule (or mol- 
ecules) of <3 kDa that inhibits early stages of 
P. falciparum development in the mosquito 
midgut in a dose-dependent manner but does 
not affect male gamete formation. 
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Identification and characterization of harmane 
The D. tsuwruhatensis TC1 supernatant was frac- 
tionated by medium-pressure reversed-phase 
liquid chromatography, and the bioactivity 
of the obtained fractions was evaluated (figs. 
S8 and S89). Analysis of the two most potent 
fractions by liquid chromatography-ultraviolet 
spectroscopy-high-resolution mass spectrom- 
etry (fig. S10) and nuclear magnetic resonance 
spectroscopy (figs. S11 to S13) identified 1-methyl- 
B-carboline (harmane) as a common major 
component in the fractions (figs. S10 to S13), 
mixed with some compounds of lipidic nature. 
A dose-response curve for inhibition of oocyst 
formation was obtained when mosquitoes were 
fed on different concentrations of harmane 
mixed with P. falciparum-infected blood (Fig. 3A). 
A 90% mean reduction in the number of oocysts 
was observed at 150 nM harmane. The half- 
maximal inhibitory concentration (ICs) was 
46 nM (Fig. 3B). Further, harmane reduced 
salivary sporozoite numbers by 80%, whereas 
the D. tsuruhatensis TC1 supernatant reduced 
sporozoite numbers by 88% (fig. S14). 
Considering the small size (<3 kDa) of the 
active molecule in the D. tsuruhatensis TC1 
supernatant, we investigated the possibility 
that it crosses the mosquito cuticle. Mosquitoes 
were exposed to culture supernatant dried 
onto a glass surface and subsequently fed a 
P. falciparum-infected blood meal. Exposed 
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Fig. 2. D. tsuruhatensis TC1 produces and secretes a small Plasmodium- 
inhibitory compound. (A) Schematic outline for generation of samples tested. 
Bacteria were grown in LB medium, washed with M9 minimal medium, and 
incubated in this medium for 8 hours at 28°C. Heating bacterial suspension for 
15 min at 70°C generated dead bacteria (Dead-B). Filtering the 8-hour culture 
through a 0.22 wm filter produced the supernatant, which was tested as is (D-8h 
and P-8h), after boiling (D-boiled), or after passage through a filter that excludes 
molecules larger than 3 kDa (D-8h<3 kDa). These samples were added to 
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P. falciparum gametocyte cultures and tested for parasite development. (B) Inhibition 
of oocyst formation. (C) Inhibition of ookinete formation. (D) P. falciparum gametocyte 
exflagellation in the presence of M9 medium or bacteria culture supernatant. 

Bars represent standard deviation of the mean. Number: number of mosquitoes 
analyzed; Prevalence: proportion of mosquitoes carrying one or more parasites. 
For (B) to (D), data pooled from three independent experiments. Statistical 
analysis of data in (B) and (C) by Mann-Whitney U test. ***P < 0.001; **P < 0.01; 
*P < 0.05. Statistical analysis of data in (D) by t test (differences are not significant). 
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A 50: Harmane feeding assay 


Oocysts/midgut 


M9 1nM 10nM 25nM 50nM 150nM D-8h 
Number 111 120 122 129 144 124 118 
Prevalence 91.9% 87.5% 69.7% 59.7% 48.6% 21.0% 24.6% 


Mean 6.7 7.4 5.2 3.7 1.3 0.7 0.8 
Inhibition “11.2% 21.8% 45.0% 80.5% 90.2% 88.7% 
Median 5 4 2.5 1 0 0 0 
Inhibition 20% 50% 80% 100% 100% 100% 
P-value NS = aS ae = om 
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Fig. 3. Harmane inhibits Plasmodium development in An. gambiae mosquitoes. 
(A and B) Harmane feeding experiments. (A) P. falciparum gametocytes were fed to 
mosquitoes with or without the indicated concentrations of harmane. M9: M9 
medium control; D-8h: supernatant of D. tsuruhatensis TC1 cultured in M9 medium 
for 8 hours. The horizontal red bars indicate median values. Number: number 

of mosquitoes analyzed; Prevalence: proportion of mosquitoes carrying one or more 
oocysts. Data pooled from three independent experiments. (B) Dose-response 
curve fit for oral feeding of harmane [nonlinear regression, n = 13, df = 12, sum of 
squares = 1054, coefficient of determination (R*) = 0.9082]. The ICs for harmane 
feeding was calculated by interpolation and is indicated below the graph. Mean 
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HA amount/m? (umol/m?) ATQ amount/m? (umol/m?) 
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inhibition relative to control oocyst intensity is indicated. Error bars are 95% 
confidence intervals (Cl). Fifty percent inhibition observed at 45.9 nM. Relates to 
data in (A). Statistical analysis by Mann-Whitney U test. ***P < 0.001; **P < 0.001; 
*P < 0.05. (€ to E) Contact inhibition experiments. (C) The D. tsuruhatensis TC1 
supernatant contains a compound that penetrates the mosquito cuticle and inhibits 
P. falciparum development. Female mosquitoes were exposed for 60 min to D-8h, 
atovaquone (ATQ), or P-8h (Pantoea supernatant) dried on a glass plate before 
infection with P. falciparum gametocytes. D-8h feeding: supernatant mixed with 
infectious blood and fed to mosquitoes, as a positive control. Data pooled from two 
independent experiments. (D) Harmane (HA) or (E) atovaquone (ATQ) were dried 
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on glass plates at the final concentrations indicated. Mosquitoes were exposed to these 
plates for 60 min and then fed P. falciparum gametocytes. M9: 3 ml of M9 medium 
dried on the plate. D-8h: 3 ml D. tsuruhatensis TC1 supernatant dried on the plate. 
(D) and (E) share the same M9 and D-8h data. Data pooled from two independent 
experiments. Statistical analysis by Mann-Whitney U test. ***P < 0.001; **P < 0.01. 
Number: number of mosquitoes analyzed; Prevalence: proportion of mosquitoes 
carrying one or more oocysts. (F) Dose-response curve fit for harmane (HA) exposure 
(nonlinear regression, n = 17, df = 16, sum of squares = 6698, R = 0.6122). The ICsq for 


mosquitoes suppressed P. falciparum oocyst 
formation with an effectiveness similar to that 
of oral uptake (Fig. 3C). This inhibition matched 
that of the antiprotozoal agent atovaquone, 
when taken up through the mosquito cuticle 
(16). Further experiments showed that the 
identified small (<3 kDa) hydrophobic har- 
mane is also a transcuticle inhibitor (Fig. 3D), 
with an ICs9 of 0.10 umol/m? (Fig. 3D), com- 
pared with 3.6 umol/m” for atovaquone (Fig. 3, 
E and F, and fig. S15). 

Harmane was administered to female mos- 
quitoes either by feeding or by contact at 0, 1, 
2, and 4 days before feeding P. falciparum- 
infected blood. Oocyst numbers per midgut 
were determined 7 days later (fig. S16A). Trans- 
mission reduction activity was pronounced 
when harmane was fed to mosquitoes just be- 
fore ingestion of infected blood, but inhibition 
was lost when mosquitoes were fed infected 
blood one or more days later (fig. SI6B). In the 
contact assay, significant transmission reduc- 
tion activity was observed at days 0 and 1 after 
mosquitoes were fed infected blood, but again, 
inhibition was lost at longer intervals after 
blood feeding (fig. S16C). 


Harmane specificity 


Asexual growth of P. falciparum 3D7A and Dd2 
strains was minimally affected by harmane, 
unlike other known asexual growth inhib- 
itors (table S6). We also found no detectable 
inhibition of male or female gametogenesis 
by harmane, in contrast to known inhibitors 
(table S7). To determine at which step har- 
mane acts during parasite sexual develop- 
ment, it was (or was not) added to P. berghei 
gamete differentiating into ookinetes in vitro. 
Harmane strongly inhibited ookinete forma- 
tion when added at time zero and also when 
added at 4 hours, when zygotes had formed, 
indicating that harmane also affects stages of 
ookinete formation after fertilization (fig. S17). 


Mathematical modeling to predict the impact 
of D. tsuruhatensis TC1 in the field 


To explore the utility of D. tsuwruhatensis TC1 as 
a potential tool for malaria control, we intro- 
duced it as an intervention within an established 
transmission dynamic mathematical model of 
malaria (17, 18). In the modeled scenario, mos- 
quitoes were colonized with D. tsuruhatensis 
TCI after feeding on attractive sugar baits, or 
ASBs (Fig. 4A). For the purposes of the model, 
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we assumed that baits are distributed during 
the peak of the malaria transmission season 
in sufficient numbers to achieve a defined bait 
feeding rate, which we varied on the basis 
of outcomes observed in entomological field 
trials (19, 20). A bait feeding rate of 0.34 per 
day was predicted to result in more than half 
of all adult female mosquitoes being infected 
at any given time (Fig. 4A). We introduced 
the baits into a high-transmission setting (en- 
tomological inoculation rate of 195 infectious 
bites per person per year), in which malaria is 
highly seasonal (27). In the simulations, baits 
were used alongside pyrethroid-treated bed- 
nets, which are a long-established interven- 
tion against malaria throughout sub-Saharan 
Africa, assuming a bednet usage of 70% fol- 
lowing a mass campaign (J). We quantified 
the impact of the baits in terms of the clinical 
cases averted within a 3-year period (Fig. 4B), 
highlighting the influence of the bait feeding 
rate (per mosquito) on the impact of the inter- 
vention. With a feeding rate of 0.34 baits per 
day, the model predicted that D. tsurwhatensis 
TCI could prevent 250 cases per 1000 people 
over a 3-year period (or a 15% reduction in cases), 
if maintained for 6 months of each year (Fig. 
4B). These results suggest that D. tswruhatensis 
TC1 could be a useful addition to the toolbox of 
interventions against malaria and would be es- 
pecially valuable for targeting residual trans- 
mission caused by outdoor-feeding mosquitoes. 


Evaluation for translation to field settings 


That a nonrecombinant bacterium can populate 
anopheline mosquitoes and inhibit Plasmodium 
development in the laboratory indicates po- 
tential for malaria control in the field. We 
conducted experiments in a MosquitoSphere 
facility in Burkina Faso that simulates the field 
environment (22) (Fig. 5A). The feasibility of 
populating a local colony of Anopheles coluzzit 
mosquitoes with D. tswruhatensis TC1 using 
this setup was tested in two ways, either by using 
cotton balls soaked with a D. tsuruhatensis 
TCI suspension in sugar and placed in clay pot 
refuges or by rearing larvae to adults in a breed- 
ing site seeded with this bacterium. Overnight 
feeding using cotton balls resulted in 75% 
colonization of mosquitoes by the bacterium 
(Fig. 5B). When larvae were reared in water con- 
taining D. tswruhatensis TCI, all the emerging 
adults carried the bacterium (Fig. 5B). Direct 
membrane feeding assays with these colo- 
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HA exposure was calculated by interpolation and is indicated below the graph. Mean 
inhibition relative to control oocyst intensity is indicated. Error bars are 95% Cl. 
Relates to (B) and fig. S14. Fifty percent HA inhibition observed at 102.2 nmol/m*. 
(G) Dose-response curve fit for atovaquone (ATQ) exposure (nonlinear regression, 
n = 13, df = 12, sum of squares = 4120, R= 0.8085). The ICso for ATQ exposure 
was calculated by interpolation and is indicated below the graph. Mean inhibition 
relative to control oocyst intensity is indicated. Error bars are 95% Cl. Fifty percent 
ATQ inhibition observed at 3634 nmol/m®. Relates to (C) and fig. S14. 


nized mosquitoes showed that D. tswruhatensis 
TC1 inhibited (by 74%) development of P. 
falciparum obtained from local patients (Fig. 
5, C and D). We also tested the ability of D. 
tsuruhatensis TC1 to colonize wild mosqui- 
toes captured in houses of the Vallée du Kou 
(VK5) village, Burkina Faso, by feeding them 
on cotton balls soaked with bacterial sus- 
pension. We found that mosquitoes carrying a 
field-acquired microbiota were as (or more) ef- 
ficiently colonized by D. tswruhatensis TC1 as 
lab-reared mosquitoes (fig. S18). 

Thus, in a contained field setting, D. 
tsuruhatensis TC1 was efficiently introduced 
into mosquitoes either orally from sugar baits 
containing bacteria or by seeding mosquito 
breeding sites with bacteria. Mosquitoes carry- 
ing this bacterium successfully suppressed de- 
velopment of locally circulating P. falciparum 
parasites. 


Discussion 


Currently employed standards of care and con- 
trol for malaria are either directed at target- 
ing the parasite in the human host (treatment 
or prophylaxis) or at killing the malaria mos- 
quito vector with insecticide-treated bednets and 
indoor residual spray. Unlike these approaches, 
the D. tsuruhatensis TC1-based intervention 
targets parasite stages in the mosquito and 
incapacitates its ability to transmit the malaria 
parasite. Delivery to either larvae or adult mos- 
quitoes was efficient in both laboratory and in 
contained field settings, indicating versatility 
for final deployment. Mosquitoes in the afore- 
mentioned MosquitoSphere are exposed to a 
close-to-natural environment, with varying 
ambient climatic conditions and access to a 
range of resting sites and dietary resources, 
including plant nectar. High colonization rates 
of mosquitoes within such natural settings, in- 
cluding in wild-caught mosquitoes, combined 
with the effective blocking of local patient- 
derived P. falciparum parasites, are key con- 
tributing features for potential implementation 
of the D. tsurwhatensis TC1-based intervention 
in the field. Although <20 bacteria populate 
the mosquito gut, once introduced, bacteria 
numbers and Plasmodium inhibition ability 
remain stable for 16 days, and possibly for 
the life of the mosquito (23). Inhibition was 
effective in An. stephensi and An. gambiae, the 
main malaria vectors from the Indo-Iranian 
region and Africa, respectively. Both human 
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Fig. 4. Mathematical modeling to predict the potential impact of Delftia as a 
public health intervention against malaria. |n this scenario, Delftia is introduced 
into a wild population of Anopheles mosquitoes by means of sugar baits. The 
epidemiological impact is estimated using a transmission dynamic mathematical 
of malaria. The setting is based on the village of Longo, in the region of 
Koulpélogo in Burkina Faso. (A) Introduction of Delftia into the wild mosquito 
population with a bait feeding rate of 0.34 per day. We assume that baits are 
maintained for part of the year, denoted by the dashed vertical lines and light blue 
Once the sugar baits are introduced, a fraction of the mosquito population 
becomes infected (maroon curve), the size of which depends on the bait feeding 
alaria transmission is maintained by older mosquitoes who have already 
blood-fed (blue curve) and then survived long enough to become sporozoite-positive 


model 


arrow. 


ate. 


(P. falciparum) and rodent (P. berghei) parasites 
are inhibited, suggesting that D. tsurwhatensis 
TC1 may inhibit all human parasites, includ- 
ing P. vivax, P. ovale, and P. malariae. Not 
only was Plasmodium infection of mosquitoes 
inhibited, but parasite transmission was also 
significantly reduced using a murine model. 
Mathematical modeling shows that the intro- 
duction of D. tsuruhatensis TC1 into a mos- 
quito population could avert an additional 
~15% of the total clinical malaria cases, further 
building the case for field implementation. 

D. tsuruhatensis TC fed to larvae are trans- 
ferred to adults, but transovarian transmission 
to the subsequent generation or horizontal 
(sexual) transmission from male to virgin fe- 
male mosquitoes was not detected. This could 
be linked to the low numbers of bacteria that 
populate the gut and implies the necessity of 
steady introduction of D. tswruhatensis TC] into 
mosquito populations. Nevertheless, absence 
of forward transmission may be advantageous 
from a safety and regulatory perspective, as 
strain containment within a single mosquito 
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estimated in an entomologi 


generation would help abate environmental 
spread and exposure to nontarget organisms. 

D. tsuruhatensis is a biosafety level 1, motile 
gram-negative bacterium from the Comamona- 
daceae family. It is widely distributed in dif- 
ferent ecosystems (fresh and marine water, 
soil, rhizosphere, plants) (24, 25), is found in 
different insect species (26, 27), and is part of 
the normal microbiota of anophelines (28-30). 
D. tsuruhatensis has been reported to be an 
opportunistic health care-associated patho- 
gen in immunocompromised individuals with 
severe underlying conditions, but reported cases 
were mostly from the same hospital setting 
(31). Importantly, D. tswruhatensis TCl-infected 
mosquitoes do not release the bacterium with 
their saliva when feeding, suggesting that it is 
not transmitted to humans by mosquito bite. 
Unlike insecticides, D. tswruwhatensis TC1 does 
not impose fitness load or reproductive costs 
on the mosquito, thus avoiding selection of 
resistant mosquitoes. Since it does not target 
malaria in the human host, where parasite 
numbers may reach 10””, the development of 
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(red curve). In this scenario, malaria transmission is seasonal, with the gray line 
showing the adult mosquito population (rescaled to fit on 


the figure), which peaks in 
ic health impact of the intervention, reported in terms of 
a 3-year period. We model the introduction of Delftia 


into a high-transmission, seasonal setting, in which the use of insecticide-treated 


(see methods). Here we show how the predicted public 


health impact varies according to the bait feeding rate (per mosquito per day) and 
the duration over which the ASBs are maintained each year—2 months (orange 
curve), 4 months (light-blue curve), or 6 months (green curve). The secondary y axis 
to the right translates the public health impact of Delftia into the percentage of cases 
averted over the 3-year period. The vertical dashed line shows the feeding rate 


cal field trial and corresponds to the rate used in (A). 


parasite resistance in the mosquito, which 
typically carries five oocysts or less, is mini- 
mized. D. tsuruhatensis TC1 is not genetically 
modified—unlike strains of Pantoea, SerratiaAS1, 
and Asaia, which have been engineered to 
express anti-Plasmodium effectors (32-34)— 
mitigating regulatory and ethical concerns for 
its release (35). Interestingly, a Delftia-based 
product (36) is currently marketed for addition 
to soil to promote soybean growth, supporting its 
potential for safe release into the environment. 
Bioassay-guided fractionation of the D. 
tsuruhatensis TC1 culture supernatant led to 
the discovery and identification of the small 
hydrophobic compound harmane, present at 
nanogram levels in active fractions. Harmane, 
a carboline derivative, was first isolated from 
Arariba rubra (or Simira rubra) (37). Itis present 
in plant extracts used in traditional medicine in 
Mali and Sao Tome (38), occurs in mammalian 
tissues and body fluids, and is also found in a 
variety of foods, including fried meat and fish, 
alcoholic drinks, and coffee brews (39). Neu- 
rotoxic effects, including psychoactive and 
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Fig. 5. Contained field experi- A 
ments. (A) The MosquitoSphere 

facility is located in Soumousso, 
Burkina Faso (upper left panel), 

and consists of six individual 
compartments (upper right panel) 
containing plants and clay pots 

(lower left panel) for mosquito 

efuge sites and larval breeding 

sites (lower right panel). (B) Effi- 
ciency of D. tsuruhatensis TC1 

bacteria introduction into adult 
mosquitoes either by seeding the 

arval breeding water with 

D. tsuruhatensis TC1 (10’ bacteria/ml) c 
or by placing into clay pots cotton 80 
balls soaked with 10’D. tsuruhatensis 
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after an overnight exposure. The 3 3 
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of the 200 mosquitoes released 
were recovered and analyzed. 

(C) An. coluzzii mosquitoes carrying 
D. tsuruhatensis TC1 bacteria (or 
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infected blood obtained from local patients to which D. tsuruhatensis TC1 supernatant or M9 medium (control) was added, and midgut oocysts were counted after 7 days. 


hallucinogenic effects, have been reported to 
occur at doses well above dietary exposure levels, 
which have been estimated at a maximum of 
1ug harmane per kilogram of body weight per 
day (40). B-carboline derivatives have been known 
to have insecticidal, fungicidal, and plant growth 
regulatory properties and have been investigated 
for activity against several parasites (4-44). Our 
studies reveal that harmane does not inhibit 
asexual stages of malaria but specifically targets 
formation of ookinetes in the mosquito at nano- 
molar concentrations, significantly reducing spo- 
rozoite prevalence in salivary glands. Harmane 
has a short half-life in mosquitoes (1 to 2 days). 
Modifications of its chemical structure or devel- 
opment of slow-release formulations could help 
extend its efficaciousness beyond 2 days. Impor- 
tantly, harmane can be administered to mos- 
quitoes by contact, and its ability to traverse the 
mosquito cuticle provides additional modalities 
for delivery via mosquito resting surfaces. 
Following the observation that insectary mos- 
quitoes became refractory to P. falciparum in- 
fection, we identified a strain of D. tswruhatensis 
that efficiently colonizes An. gambiae and 
An. stephensi and inhibits early development 
stages of the malaria parasites P. falciparum 
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and P. berghei by secreting harmane, a small 
hydrophobic compound. Potential translatabil- 
ity to the field using simple low-cost technolo- 
gies is supported by mathematical modeling 
and successful contained field studies performed 
in Burkina Faso. Studies are currently underway 
for further development of a D. tswruhatensis 
TCi-based malaria control tool for potential 
use in conjunction with existing strategies to 
achieve global malaria eradication. 
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FERROELECTRICS recovery dilemma of excellent elasticity Check for 
high crystallinity in the elastification of haat 


Intrinsically elastic polymer ferroelectric by precise 


slight cross-linking 


Liang Gao’, Ben-Lin Hu’, Linping Wang’, Jinwei Cao’, Ri He’, Fengyuan Zhang’, Zhiming Wang’, 


Wuhong Xue®, Huali Yang’, Run-Wei Li?* 


Ferroelectrics are an integral component of the modern world and are of importance in electrics, 
electronics, and biomedicine. However, their usage in emerging wearable electronics is limited by 
inelastic deformation. We developed intrinsically elastic ferroelectrics by combining ferroelectric 
response and elastic resilience into one material by slight cross-linking of plastic ferroelectric 
polymers. The precise slight cross-linking can realize the complex balance between crystallinity and 
resilience. Thus, we obtained an elastic ferroelectric with a stable ferroelectric response under 
mechanical deformation up to 70% strain. This elastic ferroelectric exerts potentials in applications 
related to wearable electronics, such as elastic ferroelectric sensors, information storage, and 


energy transduction. 


earable electronics with stretchability 

should be sufficiently elastic to con- 

form to biological tissues and accom- 

modate body movements upon large 

(up to 50 to 80%) and frequent strains 
CZ, 2). These requirements have gradually be- 
come fundamental features of materials for 
skin-like elastic electronics (3, 4). Recently, 
prototypes of wearable sensors and circuits 
based on intrinsically elastic conductors or 
semiconductors have been proposed (2, 5, 6). 
However, the elastic ferroelectric resulting 
from the elastification of ferroelectrics (FEs), 
which are crucial and promising basic mate- 
rials for modern electronics (7-11), is still 
lagging behind that of their intrinsically elastic 
counterparts, impeding their application in 
emerging wearable devices. 

FEs feature reversible spontaneous polari- 
zation and distinctive properties (FE response) 
that make them suitable for a wide range of 
applications, ranging from electronic to electric 
power and biomedical applications (12-16). 
However, the dominant polymer FEs that are 
based on poly(vinylidene difluoride) (PVDF) 
failed to recover after stress relieving though 
enabling large plastic deformation (17, 18), let 
alone the traditional oxide FEs that even expe- 
rience brittle failures at the very beginning of 
stretching (19, 20). Consequently, developing a 
new FE for wearable electronics that overcomes 
this FE response-elastic recovery dilemma is 
critical but challenging. 
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Application Technology, Ningbo Institute of Materials 
Technology and Engineering, Chinese Academy of Sciences, 
Ningbo 315201, China. *Nano Science and Technology 
Institute, University of Science and Technology of China, 
Suzhou 215123, China. “Key Laboratory of Magnetic 
Molecules and Magnetic Information Materials of Ministry of 
Education, School of Chemistry and Materials Science, 
Shanxi Normal University, Taiyuan 030032, China. 
*Corresponding author. Email: hubenlin@nimte.ac.cn (B.-L.H.); 
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Typically, the elastification of electronic mate- 
rials can be realized by structural engineering of 
rigid materials with mechanical and geometrical 
designs (4), blending of functional materials into 
elastomers (2, 3), or intrinsic elastification of 
functional materials (5, 6). Samples fabricated 
by structural engineering only enable a limited 
stretching range (less than the prestrained value) 
(2D); they often require complicated fabrication 
techniques and show reduced device densities 
within an array (4, 21). The composites obtained 
by blending oxide FE particles into elastomers 
encounter extremely challenging poling prob- 
lems (22-25). Indeed, in these composites, the 
FE domains of each oxide particle are orientated 
randomly, which requires effective poling 
(22, 23). However, as the electric field mainly 
partitions in the low-conductive elastomer 
phase, a very high poling field for the oxide 
FE particles is needed at the high risk of die- 
lectric and electromechanical breakdown of 
the composites (24, 25). Therefore, intrinsic 
elastification may be the only possible route 
to impart elastic resilience to FEs effectively. 
This process could enable the development of 
materials characterized concurrently with scala- 
ble production by solution, high device densities, 
and excellent strain endurance (5, 6). 

In the past few years, chemical cross-linking 
has witnessed substantial development in the 
intrinsic elastification of semiconductors and 
conductors (5, 6). Nevertheless, from the per- 
spective of the elastification of polymer FEs, the 
FE response-elastic recovery dilemma may still 
occur through conventional chemical cross- 
linking. In more detail, the ferroelectricity of 
PVDF-based polymer FEs originates from the 
orientation and polarization of FE domains 
in the crystalline region (8, 20, 26), which im- 
plies that an excellent FE response requires a 
high crystallinity (27-29). On the contrary, a 
medium or low crystallinity would be essential 
for outstanding elastic recovery of elastomers. 
Therefore, addressing the FE response-elastic 


challenging (30, 37). By using poly(vinylidene 
fluoride-trifluoroethylene) [P(VDF-TrFE)] with 
a soft- and long-chain cross-linker, we suc- 
cessfully developed an intrinsic elastifica- 
tion strategy for establishing a stable network 
in elastic polymer FEs by precise slight cross- 
linking. 


Design and synthesis of elastic polymer FEs 


PVDF-based FEs are semi-crystalline polymers 
and ductile under strain with characteristic 
yielding (8). The relative slippage among mo- 
lecular chains induced the necking deforma- 
tion of polymers under strain (Fig. 1A, top panel). 
This behavior is an unrecoverable deformation 
as the stress is removed, with the stress-strain 
curve of a semi-crystalline polymer with a char- 
acteristic yielding point (8, 37). Thus, PVDF-based 
FEs exhibit good plasticity but poor elasticity 
(32, 33). To combine ferroelectricity and elas- 
ticity in one material, we used chemical cross- 
linking to convert a plastic deformation into an 
elastic one. To avoid the high Young’s modulus 
resulting from general cross-linking, we devel- 
oped a cross-linking strategy with low cross- 
linking densities (1 to 2% in this work), named 
the “slight cross-linking” methodology; a long- 
and soft-chain in the cross-linker acts as a plas- 
ticizer for a low modulus. Low cross-linking 
density (see table S1 for more details about the 
definition and fig. S1 for the effect of cross- 
linking density on the mechanical recovery) 
can impart elastic resilience (recoverable up to 
125% strain) to the linear polymer FEs while 
maintaining high crystallinity for a good FE 
response. As a result, an elastic FE network ob- 
tained by slight cross-linking of linear-polymer 
PVDF-based FEs with long and soft chains was 
established for the simultaneous realization 
of good crystallinity and elastification (Fig. 1A, 
bottom panel). The elastic FEs that we developed 
have a FE response and elastic resilience, with 
an acceptable crystallinity decrease. Overall, 
this “slight cross-linking” methodology should 
be an effective solution to the FE response- 
elastic recovery dilemma. 

Experimentally (Fig. 1B), P(VDF-TrFE) is 
liable to form the f crystalline phase by a solu- 
tion process and, therefore, we used it as the 
linear FE polymer to obtain the slight cross- 
linking (8, 16, 28, 33). Compared with other 
compositions, we selected P(VDF-TrFE) 55/45 
mol % for its low modulus and large elonga- 
tion (34) (fig. S2). This enables elastification 
for wearable electronics while preserving the 
good crystallinity of P(VDF-TrFE). The block 
of polyethylene glycol (PEG) was selected as 
the soft chain because its immiscibility with 
PVDF enables cross-linking to occur mainly in 
the amorphous region, as a result of which the 
high B-phase content in P(VDF-TrFE) crystal- 
line regions was preserved for excellent FE 
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properties (35). Then, the vulcanization of fluo- 
rorubbers was realized by cross-linking P(VDF- 
TrFE) and PEG diamine through imine bonds to 
prepare the elastic FE (36). Compared with 
other cross-linking methods, the use of dia- 
mines as cross-linkers has a high reactivity 
(37), which is ideally suited for the “slight cross- 
linking” methodology with the reactive group 
at a low concentration and can be completed 
only by heating (36, 38). Moreover, its mecha- 
nism is clear with a simple addition reaction 
to the polymer backbone (8, 39). By designing 
this method to develop an elastic polymer FE, 
we expected that the high B-phase content and 
the stable network would ensure both a FE 
response and elastic recovery. 


Cross-linking and characterization of elastic 
polymer FEs 


We dissolved P(VDF-TrFE) and PEG diamine 
in cyclohexanone. We then cast the solution 
into thick films or spin-coated it into thin films. 
After removing the solvent under vacuum, we 
obtained cross-linked P(VDF-TrFE) films through 
thermal cross-linking. As shown by the differen- 
tial scanning calorimetry (DSC) curve of the 
P(VDF-TrFE) and PEG diamine blended film 
(Fig. 2A), the cross-linking reaction commen- 
ces at 188°C and ends at 254°C. We found the 
highest crystallinity in the sample cross-linked 
at 240°C under vacuum by the DSC peaks with 
different thermal cross-linking temperatures 
(fig. S3); hence, we chose this as the cross- 
linking condition to finalize the transition from 
plastic to elastic FEs. 


Pristine P(VDF-TrFE) 


Stretching 
———— 


Unrecoverable 


Stretchable 
———— 
——— 
Elastic 


We confirmed the formation of the cross- 
linked structure by the broad peak at 1647 cm™ 
(attributed to the imine bond C=N) in the 
Fourier-transform infrared (FT-IR) results 
(Fig. 2B) (36). The chemical valance of F and C 
elements remained almost unchanged because 
of the low cross-linking density, which was 
confirmed by x-ray absorption near-edge struc- 
ture (XANES, fig. S4) results. All the amino 
groups in the cross-linker (PEG diamine) were 
reacted into imine groups, which we confirmed 
by the disappearance of the amino peak spectra 
and the existence of only the imine peak in 
the cross-linked P(VDF-TrFE) of XANES (fig. S5) 
and x-ray photoelectron spectroscopy (XPS, 
fig. S6) spectra. Therefore, we could calculate 
the cross-linking density from the amount of 
PEG diamine used in the preparation stage. 
In addition, the decrease in the intensity of the 
peaks around 1120 and 1174:cm’, which repre- 
sented the TrFE unit marker (40) and the 
CF, antisymmetric stretch, respectively, sug- 
gests that the cross-linking dominantly oc- 
curred on the TrFE units (Fig. 2B, inset). We 
confirmed the good thermal stability of the 
cross-linked P(VDF-TrFE) by the onset deg- 
radation temperature above 300°C (as seen in 
the thermogravimetric analysis curve, fig. S7). 
The cross-linked P(VDF-TrFE) films cannot 
be dissolved in common organic solvents, such 
as dimethylformamide, acetone, isophorone, 
and cyclohexanone; however, they possessed a 
swelling behavior with a gel content of ~60% 
when exposed to these solvents (fig. S8 and 


table S2). The thermal and solvent resistanc- 
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es of the cross-linked P(VDF-TrFE) ensure its 
feasibility for standard electronic industry pro- 
cessing methods. 

As expected, the crystalline phase in cross- 
linked P(VDF-TrFE) mostly retained the S-phase 
morphology, which was confirmed by the 
B-phase marker peaks in the FT-IR spectra at 
840, 1091, and 1284 cm (28, 40). We also con- 
firmed the presence of the B phase by the 19.2° 
peak in the x-ray diffraction (XRD) pattern 
(fig. S9). We verified the presence of the 3/1- 
helical phase using the peaks at 507 cm! and 
18.9° in the FT-IR and XRD spectra, respec- 
tively (28). The polar crystalline phase that 
we observed in the cross-linked P(VDF-TrFE) 
films ensures a large remanent polarization 
under the applied external electrical fields. 
However, compared with the pristine P(VDF- 
TrFE) film, the crystallinity of the cross-linked 
P(VDF-TrFE) films decreased as the cross- 
linking density increased. We confirmed this 
result by calculating the latent heat and melt- 
ing enthalpy from DSC curves (figs. S10 and S11 
and table S3) (28). Consequently, we achieved 
both low modulus and elasticity through de- 
creased crystallinity and increased cross- 
linking density. 

The pristine P(VDF-TrFE) film annealed at 
135°C for 4 hours showed typical semi-crystalline 
polymer mechanical properties with a yield point 
at a strain of 11% and an elongation at break 
above 800% (Fig. 2C). However, after cross- 
linking, the elongation at break of films rapidly 
decreased, and the stress-strain curves gradu- 
ally changed from showing plastic to elastic 
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Fig. 1. Conception and synthesis of elastic FEs. (A) Schematics of the macro and molecular size changes and stress-strain curves of plastic (relative 
slippage among molecular chains leading to necking and unrecovered, top panel) and elastic (chemical cross-linking blocking the relative slippage among 
molecular chains and imparting good elastic resilience, bottom panel) deformations of polymer FEs under strain. (B) Synthesis of an elastic polymer FE by 


slight cross-linking. 
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deformations (Fig. 2C, inset). We used cross- 
linking densities ranging from 0.43 to 2.16% 
to evaluate the mechanical characteristics of 
cross-linked P(VDF-TrFE). Films with cross- 
linking densities above 1.44% exhibited the 
characteristics of elastomer stress-strain curves, 
ie., low modulus without yielding (Fig. 2D). 
In the yielding-free samples, the cross-linked 
P(VDF-TrFE) with a cross-linking density of 
1.44% possessed the highest degree of crystalli- 
nity and a suitable modulus; therefore, we 
selected this cross-linking density for all the 


subsequent experiments. We observed the elas- 
tic recovery, mainly due to the change in entro- 
py rather than energy (fig. S12), in the cyclic 
stress-strain curves of the cross-linked film 
under different strains ranging from 25 to 
125% (10 cycles in Fig. 2E and >3000 cycles at a 
40% strain level in fig. S13). Compared with 
the low resilience of the pristine P(VDF- 
TrFE) films (Fig. 2E, inset), these results con- 
firm that the intrinsic elasticity of the polymer 
FEs was successfully realized by precise slight 
cross-linking of the linear polymer FEs. 
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Fig. 2. Cross-linking characterization and mechanical properties of elastic FE films. (A) DSC 

curves of the P(VDF-TrFE)/PEG diamine blend indicating in situ cross-linking. (B) FT-IR spectra of blended 
and cross-linked P(VDF-TrFE)/PEG diamine (the peak at 1647 cm? from C=N bond confirms the occurrence 
of cross-linking). (©) Stress-strain curves of pristine and cross-linked P(VDF-TrFE) thick films at various 
cross-linking densities represented by different colors. (Inset) Transition from plasticity to elasticity is shown 
by the stress-strain curves below 45% strain presented in (C). (D) Modulus and degree of crystallinity of 
cross-linked P(VDF-TrFE) at various cross-linking densities (n = 4, where n is the number of samples that are 
used to generate statistical data). (E) Cyclic stress-strain curves of cross-linked P(VDF-TrFE) films with a 
cross-linking density of 1.44% and pristine P(VDF-TrFE) films (inset) under different strains (the first cycle 
was omitted owing to clamp sliding), indicating the excellent resilience of the cross-linked P(VDF-TrFE) 
films compared with the unrecoverable deformation of the pristine P(VDF-TrFE) films. 
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Ferroelectricity of cross-linked P(VDF-TrFE) 
We confirmed the ferroelectricity of the cross- 
linked P(VDF-TrFE) films by analysis of the 
temperature dependence of the dielectric con- 
stant (e-T curve), polarization-electric field 
(P-E) loops, and piezoresponse force micros- 
copy (PFM) results. In Fig. 3A, we observed a 
peak at T, = ~65°C in the e-T curve of cross- 
linked P(VDF-TrFE), as a feature of the FE- 
paraelectric phase transition (“Curie transition”), 
indicating that a B-like phase similar to that in 
pristine P(VDF-TrFE) is preserved (28). 

We measured the P-E loops using a capacitor- 
type device with a Pt/cross-linked P(VDF-TrFE) 
(250 nm)/Pt/Si structure (fig. S14). The P-E loop 
expands gradually when the electric field in- 
creases above 120 MV/m and then saturates 
after the electric field exceeds 200 MV/m (Fig. 
3B). The coercive field (£,) of the elastic FE 
film is 60 MV/m, similar to that of the pristine 
P(VDF-TYrFE) (55/45) film (EZ, ~59 MV/m, fig. S15). 
The maximum (Pax) and remanent (P,) polari- 
zation of the elastic ferroelectric film are 9.1 and 
4.70 wC/cm”, respectively, whereas those for the 
pristine P(VDF-TrFE) (55/45) film are 9.5 and 
5.31 C/cm”, respectively. The polarization switch- 
ing of cross-linked P(VDF-TrFE) can be cycled 
more than 10° times (fig. S16). We also mea- 
sured the P-E loops of the cross-linked P(VDF- 
TrFE) at different frequencies (Fig. 3C). As the 
frequency decreases from 10 kHz to 500 Hz, 
P, increases from 3.02 to 5.87 wC/em?, and the 
shape of the P-E loops becomes increasingly 
rectangular, indicating that the switching of 
the FE domain in the cross-linked P(VDF-TrFE) 
requires a longer time and more energy than 
that in the pristine material (fig. S17). The dif- 
ference in the P-E loops of the cross-linked 
P(VDF-TrFE) films is attributed to the decreas- 
ing crystallinity and size of the crystals with 
the increasing cross-linking densities, com- 
pared with those in the pristine P(VDF-TrFE) 
films (fig. S18) (47). Indeed, the crystallinity of 
the cross-linked P(VDF-TrFE) films decreased 
to 38.36 + 1.76%; the FE phases also changed 
from the pure f phase to a mixture of B and 
3/1-helical phases. Although the latter is consid- 
ered a weak polar phase, it can be transformed 
into the former by poling under a high electric 
field at room temperature and, thus, contrib- 
ute to overall polarization (42). 

PFM is a powerful tool that can verify the 
ferroelectricity of a material by exploiting the 
existence of domains with different polariza- 
tion orientations and local hysteretic domain 
switching triggered by electric fields (43-45). 
The ferroelectricity test of the elastic polymer 
FE was conducted by PFM using conductive 
tips on the cross-linked P(VDF-TrFE) film with 
a thickness of 250 nm deposited on a Pt/Si 
substrate. Box-in-box phase and amplitude 
patterns are obtained by applying a —-30 V bias 
on a 12 um by 12 um zone and then a +30 V 
bias in the 8 um by 8 um-centered one (Fig. 3, 
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Fig. 3. FE response of cross-linked P(VDF-TrFE) thin films. (A) e-T curve at 1 kHz, confirming that the cross-linked P(VDF-TrFE) maintained FE polymer 
behaviors. (B and C) P-E loops of Pt/cross-linked P(VDF-TrFE)/Pt under different electric fields at 1 kHz (B) and at different frequencies (C). (D and E) Phase (D) and 
amplitude (E) mapping of PFM. (F) Phase—voltage hysteresis and amplitude—voltage butterfly loop. 


D and E). Our results show that the polarity of 
the FE domains in the crystalline region can 
be reversibly switched by applying voltage 
rather than the injected charges, as indicated 
by scanning Kelvin probe microscopy (SKPM) 
(fig. S19). Furthermore, a piezoelectric coefficient 
(d33) of 11.3 pm/V was revealed by PFM (fig. 
$20). As exhibited by the hysteresis and butter- 
fly loops (Fig. 3F), the local piezoresponse indi- 
cates a complete FE switching. 


FE response of cross-linked P(VDF-TrFE) 
under strain 


We prepared fully elastic capacitors with a 
liquid metal (gallium, Ga) as elastic electrodes 
using a sacrificial-layer microfabrication method 
(6) (fig. S21) to test the FE response of the elastic 
polymer FE under different applied strains and 
frequencies. We present the full elastic Ga/ 
cross-linked P(VDF-TrFE)/Ga/PDMS device 
in Fig. 4A. Compared to a rigid device with 
Pt as electrodes, the P-E loops of our fully 
elastic device, without an applied strain and 
at different voltages, show more rectangular 
P-E hysteresis (fig. $22) with larger P, and 
Pymax Values. Nevertheless, we observed higher 
EF, values, which may be attributed to the dif- 
ferent thicknesses of the transition layer formed 
by different electrode materials. We observed 
the frequency-dependent behavior for the P-E 
response loops of the fully elastic device at 
different frequencies (fig. S23) that is similar to 
that of the Pt-electrode device (Fig. 3B). We 
concluded that the cross-linked P(VDF-TrFE) 
films in the fully elastic device also provide an 
equivalent FE response compared with those 


in the rigid device. We have therefore devel- 
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Fig. 4. FE response of elastic FEs under strain. (A to C) Full elastic device (A) in the stretching set-up 
under 0 (B) and 70% strain (C). (D) P-E loops at 1 kHz under strain from 0 to 70%. (E) Nominal and true P,, 
Prax, and E, under different strains (n = 5, where rn is the number of P-E loops that are used to generate 
statistical data), suggesting that the cross-linked P(VDF-TrFE) possessed a stable FE response. 


oped an effective method to prepare a fully 
elastic FE device that can be applied to other 
elastic FE materials. 

We developed the stretching set-up to apply 
the strain for the FE response under strain 
(fig. S24). After uniaxial stretching, the full 
elastic device presented a rubber-like behavior 
with a Poisson's ratio of ~0.45 (e.g., 70% axial 


strain and less than 35% transverse strain in 
our case) (Fig. 4, B and C). The electric fields 
and device areas without strain were taken as 
nominal values. We show P-E loops under 
strains ranging from 5 to 70% (figs. S25 to S32). 
As the strain increased, the degree of rectangu- 
larity of the P-E loops of the elastic FEs (Fig. 4D) 
gradually improved. At the same time, the values 
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of P, Pmax, and E, extracted from the P-E loops 
under different strains showed the same trend 
(Fig. 4E). To evaluate the effect of strain on the 
FE response, we performed a simulation to 
calculate the true values of the area (figs. S33 to 
$35), and the results were in accord with 
Eshelby's inclusion (46). The true values of P,, 
deducted by changing strain values, during 
stretching from 0 to 70% remained almost con- 
stant, which implies that the FE response of our 
elastic ferroelectrics was not affected by the 
applied strain. After the stress was removed, 
the FE response remained nearly invariable 
(fig. S36). 

We proposed a “slight cross-linking” method- 
ology to develop elastic FEs. We successfully 
prepared an elastic FE by slightly cross-linking 
plastic polymeric FEs with soft chains into a 
stable FE network, which can be produced by 
solution processing and standard electronic 
industry methods. Our elastic polymer FE 
simultaneously showed a FE response and 
elastic recovery, even under strains up to 70%. 
The slight cross-linking is an effective way out 
of the FE response-elastic recovery dilemma 
for the elastification of polymer FEs. We believe 
that the realization of intrinsically elastic FEs 
can bridge the gap between FE materials and 
emerging wearable electronics, enabling broad 
potential applications such as wearable sensing, 
information storage, and the transduction and 
storage of energy. 
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Photosensitized 0. enables intermolecular alkene 
cyclopropanation by active methylene compounds 


Dhruba P. Poudel, Amrit Pokhrel}, Raj Kumar Tak{, Majji Shankar, Ramesh Giri* 


Cyclopropanes are key features in many preclinical, clinical, and commercial drugs, as well as natural 
products. The most prolific technique for their synthesis is the metal-catalyzed reaction of an alkene 
with a diazoalkane, a highly energetic reagent requiring stringent safety precautions. Discovery of 
alternative innocuous reagents remains an ongoing challenge. Herein, we report a simple photoredox- 
catalyzed intermolecular cyclopropanation of unactivated alkenes with active methylene compounds. The 
reaction proceeds in neutral solvent under air or dioxygen (Oz) with a photoredox catalyst excited by 
blue light-emitting diode light and an iodine co-catalyst that is either added as molecular iodine or 
generated in situ from alkyl iodides. Mechanistic investigations indicate that photosensitized O. plays a 
vital role in the generation of carbon-centered radicals for both the addition of active methylene 


compounds to alkenes and the ring closure. 


yclopropanes are highly valuable and 

versatile intermediates in the synthesis 

of complex molecules and natural products 

(1-3). Cyclopropyl moieties are also exten- 

sively featured as a pivotal design element 
in preclinical and clinical drug molecules (4) to 
achieve specific therapeutic goals and are a 
key motif in biologically active molecules (5) 
and natural products (6, 7) (Fig. 1). For example, 
the recently US Food and Drug Administration- 
approved drugs nirmatrelvir (trade name Paxlovid) 
for COVID-19, mitapivat (trade name Pyrukynd) 
for hemolytic anemia, and deucravacitinib (trade 
name Sotyktu) for plaque psoriasis, along with 
14 of 200 top-selling small-molecule commer- 
cial drugs (4), including montelukast (trade 
name Singulair, etc.) for asthma, telaprevir 
(trade names Incivek and Incivo) for hepatitis 
C, and abacavir (trade name Ziagen, etc.) for 


Fig. 1. Marketed 
drugs containing 
a cyclopropyl ring, H H 


Oo tBu 


Nirmatrelvir (COVID-19) 


Telaprevir (hepatitis C) 
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HIV/AIDS (Fig. 1), contain a cyclopropyl ring 
as a key element. Cyclopropanation is therefore 
one of the most heavily investigated strained 
ring-forming reactions in organic synthesis. 
However, the carbonaceous triangle is also 
among the most difficult rings to create be- 
cause of the strain, which destabilizes the ring 
(by 27 kcal/mol) and renders it prone to open- 
ing during reactions. 

Among the limited methods available for 
intermolecular cyclopropanation (8), the most 
preferred and reliable is carbene insertion into 
alkenes (9-17). This method is profoundly depen- 
dent on reactive carbene and metal-carbenoid 
intermediates [[M]=CR,], in particular the 
donor-acceptor Rh-carbenoids (9, 10, 12) 
derived from highly energetic diazoalkanes, 
which require stringent precautions for handl- 
ing (13, 14). The metal-carbenoids are typically 


Singulair (asthma) 
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PhCHNg, and a-diazocarbonyl compound. 
decomposition in reaction with transition-metal 
[M] catalysts (8, 15, 16), and more recently under 
photoredox (77, 18) and engineered enzymatic 
(19-22) conditions. The classic Simmons-Smith 
and related reactions (23) also transform al- 
kenes into cyclopropanes, yet, again, these reac- 
tions require a large excess of unstable, often 
difficult-to-access 1,1- and 1,2-dihaloalkanes and 
highly reactive Et.Zn or Zn/Cu reagents to 
generate an adequate quantity of zinc carbe- 
noids for satisfactory reactivity. Recently, 
catalytic methods have emerged to simplify 
cyclopropanation of alkenes with alternate 
precursors (24). However, these processes 
also depend on the generation and exploita- 
tion of [M]=CR, intermediates (25-28) and 
multistep synthetic manipulations to arrive at 
reactive [M]=CR, intermediates in situ. Alter- 
natively, the noncarbenoid approaches, such 
as the Johnson-Corey-Chaykovsky reaction (29) 
through a Michael addition-initiated ring clo- 
sure (30, 31), require careful molecular pre- 
functionalization to install a leaving group at a 
precise reaction site either on Michael acceptors 
(a,B-unsaturated carbonyls) or Michael donors, 
as well as a strong base, such as LDA, KHMDS, 
NaNH, or BulLi, to enolize the Michael donors 
for sufficient nucleophilicity. Additionally, the 
Kulinkovich reaction (32-34), which converts 
terminal alkenes into cyclopropyl alcohols and 
amines, suffers fundamentally from functional 
group incompatibility and 1,2-addition reac- 
tions because of the use of superstoichiometric 
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Fig. 2. Hypothesis and reaction parameter optimization. Reactions were conducted in 0.10 mmol scale with 0.20 mmol diethyl malonate, unless stated 
otherwise, in 0.50 ml of DMF at 440 nm blue LED (36-watt Kessil lamp at 100% intensity) and ambient temperature (~35 to 40°C controlled by fans) in 1-dram 
capped glass vials. Yields were determined by GC with trimethoxybenzene as a standard. 802, 5% O2/No, and air were supplied through a balloon. 87% yield refers to the 


use of 10% O2/No. 


quantities of titanium salts and alkyl Grignard 
reagents to generate titanacyclopropanes in 
situ as cyclopropanating reagents. 

Evidently, the catalytic intermolecular con- 
struction of cyclopropyl rings on unactivated 
alkenes with simple, readily available, and bench- 
stable chemicals in an operationally straight- 
forward and broadly applicable fashion is an 
ongoing challenge (35-39). Herein, we disclose 
a simple photoredox approach for the inter- 
molecular cyclopropanation of unactivated 
alkenes directly with active methylene com- 
pounds through cooperativity of photosen- 
tized dioxygen (Oz) and alkyl halide through 
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an electron relay system (Fig. 2). This method 
offers a convenient protocol with bench-stable 
chemicals under neutral reaction conditions 
in air and O, without requiring special safety 
and sensitivity precautions. In addition, because 
hundreds of active methylene compounds with 
varied functional substituents are commer- 
cially available and an almost unlimited num- 
ber of their analogs can be readily synthesized 
and stored for prolonged periods for use on 
demand, the current reaction provides a sim- 
ple, practical, and broad-scope method for the 
intermolecular cyclopropanation of unactivated 
alkenes. 
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Reaction development 

The direct cyclopropanation of alkenes with 
active methylene compounds requires the re- 
moval of two protons and two electrons (or H) 
through redox chemistry (Fig. 2). We envi- 
sioned that such a process could be mediated 
by an excited-state photocatalyst (PC*) through 
stepwise electron transfer (ET) and the genera- 
tion of a-carbon-centered radicals. Because the 
direct ET reduction of an active methylene 
group to remove a hydride might not be viable 
at an acidic a-carbon center from a thermody- 
namic standpoint, we opted for the intermediacy 
of photoexcited oxidants to generate high-energy 
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Fig. 3. Proposed catalytic cycle with supporting details for the mechanism by Iz generation and detection, product profile, and light on-off experiment. 


radicals that could function as relays for o-H 
abstraction. Accordingly, we initially examined 
a range of common peroxide oxidants with 
1,2,3,5-tetrakis(carbazol-9-yl)-4,6-dicyanobenzene 
(4CzIPN) as a PC under a blue light-emitting 
diode (LED) light to generate intermediary 
radicals that might abstract an a-H atom from 
an active methylene (Fig. 2A) (entries 1-6) (40, 41). 
When 4-phenylbutene and diethyl malonate 
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were used as reactants, di-tert-butyl peroxide 
and pyridinium N-oxide reacted through a 
hydroalkylation pathway with no cyclopro- 
panation products observed (entries 1 and 2). 
An initial attempt to generate a superoxide 
anion by oxygen photosensitization (42) also 
did not produce any cyclopropanated product 
(entry 5). However, we discovered that a com- 
bination of peroxide oxidants and alkyl iodides, 
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PUBS: SREB 


such as cyclohexyl iodide (cHex-I), began to 
furnish the cyclopropanated product 4 in ob- 
servable amounts (entries 7 to 10). When O. 
was combined with cHex-I, the photocatalytic 
reaction furnished the cyclopropanated prod- 
uct 4 in quantitative yield in 3 hours (entry 11). 
The formation of the cyclopropyl ring was 
unambiguously established by a single crystal 
x-ray diffraction analysis of hydrolyzed product 
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Fig. 4. Catalytic reaction with lz and further mechanistic studies probing the potential involvement of carbenes and a-iodocarbonyls as reaction 


intermediates. 


4 as dicarboxylic acid. Control experiments 
under Nz and in the absence of alkyl iodide 
furnished no product (entries 5 and 12), indi- 
cating that O, and alkyl iodide are critical in the 
success of the reaction. 
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Further studies indicated that the reaction 
could be performed simply in air and even with 
only 10% O, in nitrogen (entry 11). Reactions 
could be conducted with 0.10 mol % (1000 ppm) 
of the PC at small scale (0.10 mmol) to furnish 
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the product in quantitative yield in 3 hours. 
Large-scale (10 mmol) reactions could also be 
performed with 1000 ppm PC under the current 
reaction conditions, albeit for a longer reaction 
time and with a decrease in the yield (82%, 
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Fig. 5. General scope of active methylene compounds. Reactions were conducted in 1.0 mmol scale, unless stated otherwise, in 5.0 ml of DMF at 440 nm blue 
LED (36-watt Kessil lamp at 100% intensity) and ambient temperature (~35 to 40°C controlled by fans) under Oz in 6-dram capped glass vials. Reported yields are for 
isolated products. dr was determined by proton nuclear magnetic resonance (‘H NMR). Yields in parentheses are for 0.10 mmol scale with 5 mol % Iz. Under 


air for 3 hours. #Under O» for 3 hours. 


2.38 g). Except for tert-alkyl iodide, other alkyl 
iodides such as n-butyl iodide, 7-butyl iodide, 
and cyclopentyl iodide could be used instead 
of cyclohexyl iodide (Fig. 2B1). Alkyl bromides 
and chlorides were not effective. The reaction 
proceeded most effectively in N,N’-dimethyl- 
formamide (DMF) or DMF plus dioxane (1:4); 
other polar and midpolar solvents were either 
not effective or generated the cyclopropyl prod- 
uct 4 in low yields (Fig. 2B2). Next, we examined 
different organic and organometallic PCs with 
a range of oxidation potentials. Among several 
common PCs examined (Fig. 2B3), eosin Y 
sodium salt (11), [Ir(dtbbpy)(ppy).][PF.6] (20) 
and [Ir(dFCF3ppy))o(bpy)]PF, (21) furnished 
the product 4 in high yields, and Ir(ppy)s3 (19) 
was moderately effective. 
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Mechanistic studies 

Prompted by the observed roles of alkyl iodides 
and O, for catalytic turnover, we ventured to 
further study the reaction mechanism. Analysis 
of solutions of PC, alkyl iodide, and 4-phenyl- 
butene (1) in DMF in different combinations 
by ultraviolet-visible (UV-Vis) and fluorescence 
spectroscopies indicated that there was no inter- 
action of alkyl iodide with either the PC or 
alkene 1 (see the supplementary materials for 
details). Independent reactions conducted with 
dodecyl and hexadecyl iodides under the stan- 
dard catalytic conditions confirmed the forma- 
tion alkyl formates (22, 69%; 23, 72%) generated 
from the reaction of alkyl radicals with O, in 
DMF along with the cyclopropyl product 4 
and unreacted alkyl iodides in 78 to 85% and 22 
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to 23%, respectively (Fig. 3A). Because reaction 
solutions usually turned reddish brown during 
the reaction, we further analyzed reaction mix- 
tures by UV-Vis spectroscopy and confirmed 
the generation of I, (Fig. 3B). These experi- 
ments suggested that alkyl radicals are not 
involved in the reduction of PC radical cations 
for catalytic turnover. Rather, iodide (I) or 
iodate (IO; ), generated from the hydrolysis or 
the oxidation of I,, furnish the required elec- 
trons to reduce the intermediary PC radical 
cations (43, 44). To further confirm the role of Is, 
we conducted the standard catalytic reaction 
by replacing cHex-I with 5 mol % I,. The reaction 
generated the cyclopropyl product 4 in 99%, 
89%, and 83% yields under Ong, air, and 10% O4 
in No, respectively, in 3 hours (Fig. 4A). 


5 of 9 


RESEARCH | RESEARCH ARTICLE 


Terminal alkenes 
CO,Et 
me ~ Ay oor eee ee ee pia ; CORES 
CO,Et CO,Et COzEt CO,Et 
55, 80% 56, 92% 57, 82% 58, 93% 
CO.Et fe) 
cOaFt o Ay : PO ee ae sei 
Oh aA j Co,Et Br ~~ ‘ S co,et 
CO,Et 2 CO2Et CO2Et 
ie) CO2Et 
59, 84% (71%),8 (73%)* 60, 81% 61, 69% 62, 42% (dr, 1.3:1), (35%)* 63, 68% 
H Br 
9 CO,Et Eto,C ea Neo CO2Et 5 CO,Et 
EtO-P. BO ae YT 
etd CO:Et see CO2Et ra} CO2Et CO,Et 
2 CO.Et Me Br Br 
64, 75% 65, 62% 66, 63% 67, 65% 68, 48% 
MeO. 
CO,Et CO-Et 
COLEt an a ‘ ig o AY ; Eto OOO 4 Sart 
CO.Et EtO.C CO.Et 2 
CO,Et cs 2 CO,Et 
69, 84% 70, 77% (64%)" 71, ei (dr, 1:1) 72, 94% 
Tnternal alkenes unreactive alkenes 
Me Cr on O2Et Oe CO2Et oO O2Et  styrenes Cr 
AA CO2Et oe CO,Et Ux. CO,Et 
CO,Et ee CO,Et Cie 
73, 64% (dr, 1.5:1) 74, 67% (dr, 1:1) (49%)8 75, 72% 76, 68% (55%)* 77,61% (dr, 1.1:1) 78, 70% ices 
: ou Me, 
Drugs and natural products Me ° “, rs 
EtO.¢ SC 
Eto.c EtO,C Eto,c (o) (o) cc 
EtO.C EtO,C 


79, 75%(dr, 1.1:1) (92%),8 (90%)* 
(from Estrone) 


Me 


82, 67% (dr, 1.1:1) 
(from a -Tocopherol - Vitamin E) 


EtO.C. co, €t 


85, 49% (52%),§ (50%)* 
(from Indomethacin) 


86, 75% 
(from Ketorolac) 


EtO,C__CO,Et 


H 
HO. N 
eO. NN 
| sa 
N 


89, 77% (dr, 2:1) 
(from Quinine) 


Eto,c 


4 


90, 79% (dr, 1.9:1) (72%)* 
(from Cinchonidine) 


CO,Et 


Poudel et al., Science 381, 545-553 (2023) 4 August 2023 


80, 88% (dr, 1.5:1) 
(from Dihydrocholesterol) 


fo) 
Eto, on 
EtO,C Se | y 
Me 
83, 54% 


(from Theobromine) 


87, 43% 
(from Penicillin G) 


Me if OMe VV ~COzEt 


91, 75% (dr, 2.5:1.6:1.3:1) 
(from (+)-Linalool oxide) 


CN Me 


81, 74% (dr, 1.2:1) 
(from 7-Hydroxyflavonone) 


CH, os 


84, 89% (dr, 1.1:1) 
(from Loxoprofen) 


CO.Et 


EtO,C. CO,Et 


88, 66% (dr, 1.4:1) 
(from Ethyl oleate) 


Et0,C~ ~CO,Et 


92, 78% (dr, 1:1) 
(from Dihydromyrcenol) 


6 of 9 


RESEARCH | RESEARCH ARTICLE 


Fig. 6. General scope on alkenes, complex molecules, and functional group compatibility. Reactions were conducted in 1.0 mmol scale, unless stated 
otherwise, in 5.0 ml of DMF at 440 nm blue LED (36-watt Kessil lamp at 100% intensity) and ambient temperature (~35 to 40°C controlled by fans) under Oz in 
6-dram capped glass vials. Reported yields are for isolated products. dr was determined by 'H NMR. Yields in parentheses are for 0.10 mmol scale with 5 mol % lo. 


§Under air for 3 hours. #Under O2 for 3 hours. 


We propose a catalytic cycle (Fig. 3C) for the 
cyclopropanation reaction that is based on the 
above findings and a series of additional ex- 
periments and characterization of 13 different 
products supporting the existence of various 
catalytic intermediates as outlined in Fig. 3D. 
We anticipate that O, is first reduced by pho- 
toexcited PC* to superoxide ion (O.” ), which 
abstracts a-H from active methylene compounds 
to generate a-C radicals. We have confirmed the 
formation of these a-C radicals by the isolation 
of the saturated dimer 24 and alkene dimer 25 
as side products from catalytic reactions. Because 
both the cyclopropyl product 4 and the alkene 
dimer 25 could also arise from carbene inter- 
mediates, we designed an intramolecular com- 
petition experiment in which diethyl o-allyl 
malonate bearing a methine (a-C-H) group 
was exposed under the standard conditions 
to di-tert-butyl malonate containing a methy- 
lene (a-CH.) group (Fig. 4B). The reaction 
yielded a 2:3 mixture of the expected cyclo- 
propyl product 26, along with another cyclo- 
propyl product (27) derived from the innate 
methine group, indicating that carbene inter- 
mediates are not generated in the reaction. In 
the absence of alkene, the a-C radicals undergo 
radical dimerization, followed by further oxi- 
dation to generate mono- and dihydroxylated 
products (31 and 32). Additionally, when a 
reaction of diethyl a-allyl malonate with diethyl 
malonate was allowed to continue, o-hydroxy- 
lated cyclopropyl product 33 was generated, 
suggesting that the superoxide ion (O.” ) or 
the radical anions are capable of abstracting 
a-H from active methylene compounds such as 
in Int-1, Int-3, and Int-4. 

The o-C radicals then most likely undergo 
addition to alkenes to form secondary C radicals, 
which subsequently react with O. to generate 
peroxy radical anions. The radical addition 
steps for the formation of Int-2 and Int-3 are 
supported by the isolation of hydroalkylation 
product 5 from the standard reaction under 
reduced O, concentration (air) and a hydroxyl- 
actone (34) through deaminative oxidation when 
malonamide was used. In addition, a reaction 
of indene with diethyl malonate generated an 
oxo-product (35), further confirming the exis- 
tence of Int-3 in solution. The radical anions 
(Int-3) then abstract the intramolecular o-H 
to generate o-C radicals, which then subse- 
quently undergo radical 1,3 substitution with 
the peroxide to create the cyclopropyl ring. An 
on-off experiment in which the product was 
formed only when the reaction was exposed to 
the blue LED light (Fig. 3E) rendered an effi- 
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cient radical chain propagation mechanism 
unlikely. 

Because our reaction protocol involves cata- 
lytic In, we also investigated other mechanistic 
possibilities by which photoredox catalysis 
operates for cyclopropanation with halogenated 
carbon sources (45). Photoredox alkene cyclo- 
propanation reactions with halogenated car- 
bon sources have been previously proposed 
to proceed by atom transfer radical addition 
(ATRA) (46, 47). In particular, Carreira and 
co-workers (39) recently reported a photoredox 
benzothiazinoquinoxaline-catalyzed cyclopropana- 
tion of alkenes with o-bromo-f-ketoesters and 
a-bromomalonates through a radical ATRA 
followed by a base-promoted intramolecular 
1,3-Sy2 process. Tokuyama and co-workers (48) 
reported a photocatalytic intramolecular cyclo- 
propanation of a-bromodicarbonyls with a 
pendant alkene, which proceeded by an ATRA/ 
1,3-Sy2 sequence. Molander and co-workers (27) 
observed a radical polar crossover mechanism in 
photoredox alkene cyclopropanation with an 
iodomethylsilicate. In addition, Suero and Herraiz 
(49) reported cyclopropanation of alkenylarenes 
with an o,a-diiodocarboxylate, a reagent that 
generates o-iodo-a-C radicals with potential for 
cyclopropane ring closure by carbeniod inter- 
mediates and an 1,3-Sy2 displacement. Cyclo- 
propanation by 1,3-S,2 substitution requires 
a base for the generation of a carbon nucleo- 
phile by deprotonation. By contrast, bases are 
detrimental to the cyclopropanation reaction 
under our current conditions. For example, 2,6- 
lutidine, a base critical for cyclopropanation by 
1,3-Sy2 substitution (39), reduced the yield of 
the cyclopropanated product 4 by 35% (Fig. 
4C). 4-Dimethylaminopyridine and pyridine 
also remained similarly detrimental, where- 
as Et.NH, Et,N, and 7Pr.NH completely sup- 
pressed the cyclopropanation reaction. Analysis 
of our catalytic reaction mixtures by gas chro- 
matography (GC), GC-mass spectrometry, and 
liquid chromatography-mass spectrometry 
indicated no formation of o-iodomalonate or 
a,a-diiodomalonate as intermediates. A con- 
trol experiment attempting reaction of 0.5 
equivalents of I, with diethyl malonate under 
our catalytic conditions in the absence of an 
alkene also showed no formation of a-iodo- 
and a,a-diiodomalonates, and unreacted diethyl 
malonate was recovered (75%) (Fig. 4D). The use 
of a-iodomalonate as a reactant also failed to 
generate cyclopropyl product 4 under the stan- 
dard conditions. Rather, the reaction formed a 
1,2-iodoformylated product (Fig. 4E). A similar 
control experiment with a 1,3-diketone (diben- 
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zoylmethane) in the presence of I, without an 
alkene generated a-iododiketone in 31% yield, 
along with a 1,2,3-trione (14%) and unreacted 
1,3-diketone (48%) (Fig. 4F). However, the reac- 
tions of a-iododiketone alone or of a mixture of 
a-iododiketone and a,o-diiododiketone (2:3 
ratio) under the standard reaction conditions 
generated not the cyclopropanated product 4, 
but rather the 1,2,3-trione and the 1,2-iodofor- 
mylated products, further suggesting that any for- 
mation of a-iododiketone and o,c-diiododiketone 
intermediates would lead to undesired products 
rather than cyclopropanation (Fig. 4, G and H). 
These experiments unambiguously confirm 
that a-iodo- and o,,c-diiodocarbonyls are neither 
substrates nor reaction intermediates in the 
current cyclopropanation and that the reaction 
most likely operates by a photoexcited O.- 
mediated, PC*/I,-catalyzed pathway in which 
O, enables both the generation of malonate 
a-C radicals and the interception of secondary 
C radicals to mediate ring closure. 


Substrate scope exploration 


With optimized reaction conditions, we explored 
the scope of the cyclopropanation reaction with 
regard to the active methylene compounds in 
the presence of both stoichiometric cHex-I and 
catalytic I, and in air and O, (Fig. 5). The reaction 
under both catalytic protocols demonstrated 
a broad scope of 19 methylene compounds in 
five different clusters from the standpoint of 
synthetic compatibility and generated a suite 
of 1,1-dicarbonylcyclopropanes. Initially, we 
examined the reactivity of the most common 
active methylene compounds bearing esters, 
ketones, and nitriles with the same or differ- 
ent functionality on the two termini (36 to 45) 
(Fig. 5A). The reactions with methylenes bearing 
different functionalities, such as cyanoesters, 
ketoesters, and ketonitriles (40 to 45), pro- 
ceeded with moderate to good diastereoselec- 
tivity to generate trisubstituted cyclopropanes 
(dr, up to 4:1). The diastereoselectivity could also 
be increased (1.6:1 to 3.8:1) by replacing Et in 
44: with t-Bu in 45. The reaction also displayed 
high efficacy with sulfonylated methylenes and 
afforded the trisubstituted sulfonylcyclopro- 
pane products (46 and 47) in high yields and 
with good diastereoselectivity (Fig. 5B). An x-ray 
diffraction analysis of the hydrolyzed product of 
the major disastereomer of sulfonyl cyclo- 
propane #7 revealed that the larger groups, the 
sulfonyl and the alkyl groups, were disposed 
in an anti-fashion along the cyclopropyl plane, 
suggesting steric control of diastereoselec- 
tivity. The reaction was also compatible with 
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introduction of carbonylated heterocycles on 
cyclopropyl cores, as demonstrated by the for- 
mation of products from active methylenes 
bearing furanyl, thiophenyl, and pyridyl rings, 
along with esters (48 to 50) with moderate to 
good diastereoselectivity (Fig. 5C). The method 
also tolerates some of the most sensitive func- 
tional groups. For example, isocyanate (Fig. 5D) 
can be readily introduced into cyclopropane 
rings (51). Cyclopropyl isocyanate is an im- 
portant building block in pharmaceutical and 
medicinal chemistry for the production of 
cyclopropyl amides [e.g., the anticancer drug 
lenvatinib (trade name Lenvima) (50)] and cyclo- 
propyl urea analogs (57, 52) [NPR-A agonist 
(53)]. Currently, this building block is synthe- 
sized in three steps commencing with the cyclo- 
propanation of acrylic esters by diazomethane. 
In addition, the isocyanate cyclopropane 51 could 
be readily hydrolyzed in quantitative yield to ob- 
tain cyclopropyl a-amino acid 52, another im- 
portant building block that requires multistep 
synthesis (54-56). Primary and secondary amides 
containing active hydrogens are also compatible 
with the cyclopropanation method (Fig. 5E) to 
generate cyclopropyl carboxamides (53 and 54) 
in excellent yields with moderate diastereo- 
selectivity. Our protocol offers a direct and con- 
venient route to cyclopropyl carboxamides, which 
are currently synthesized in two to four steps from 
diazo- or dihaloalkanes (57, 58). The products 
of the current process, such as 1,1-dicarbonyl- and 
1-carbonyl-1-sulfonylcyclopropanes, are known to 
function as inhibitors against a range of bi- 
ological targets, including B-site APP cleaving 
enzyme (BACE) inhibitors for Alzheimer's dis- 
ease (59-672), and therefore represent high- 
profile design elements in drug discovery. 
Next, we examined the scope of the cyclo- 
propanation reaction on alkenes, taking diethyl 
malonate as a representative active methylene 
compound (Fig. 6). The reaction proceeds effi- 
ciently with terminal alkenes containing acyclic 
and cyclic alkyl and aryl backbones and toler- 
ates various functional groups, such as ester, 
carbonate, carbamate, epoxide, ether, alkyne, 
alkyl bromide, and aryl bromide (55 to 72). 
Moreover, alkenes bearing active hydrogens on 
a-carbons, such as phosphates (64) and malo- 
nate esters (65), and on amide nitrogen, such 
as secondary carbamates (66 and 67), are also 
excellent substrates for cyclopropanation. The 
reaction can be controlled for monocyclopro- 
panation of substrates containing two alkenes 
(70) and is also applicable for dicyclopropa- 
nation of both alkenes (71), further highlighting 
the synthetic utility for selective cyclopropana- 
tion. This method also works well for the cyclo- 
propanation of more sterically challenging linear 
and cyclic disubstituted internal alkenes, which 
generate tetrasubstituted cyclopropyl products 
(73 to '78) (62). In general, cHex-I was higher 
yielding than catalytic I, when used for complex 
substrates, potentially because of the slow 
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release of catalytic I, by cHex-I. The current 
reaction condition was not effective for the cyclo- 
propanation of styrenes and dienes. Although 
dienes remained unreactive, styrenes generated 
2-alkylated acetophenones such as diethyl 2- 
(indanonyl)malonate 35 (Fig. 3C). 

We also examined the scope of the reaction 
with respect to complex pharmaceuticals and 
natural products bearing alkenes (Fig. 6). The 
alkenes in the O-allylated steroids estrone (79) 
and dihydrocholesterol (80) and O-allylated 7- 
hydroxyflavonone (81) and a-tocopherol (vitamin 
E) (82) generated cyclopropanated products in 
good to excellent yields. The N-allylated al- 
kaloid heterocycle theobromine (83) and the O- 
allylated nonsteroidal anti-inflammatory drugs 
(NSAIDs) loxoprofen (84) and indomethacin 
(85) were similarly cyclopropanated with diethyl 
malonate. The C- and O-diallylated NSAID 
ketorolac (86) was efficiently converted to a prod- 
uct with two trisubstituted cyclopropyl rings. 
Drugs such as penicillin G (87), which contain 
acidic NH and a sensitive B-lactam ring, were 
also compatible. The naturally occurring inter- 
nal alkene in the fatty acid ester ethyl oleate (88) 
also furnished tetrasubstituted cyclopropyl prod- 
uct in good yield. Additional innate alkenes in 
aliphatic and heterocyclic natural products bear- 
ing a free hydroxy group, such as the alkaloids 
quinine (89) and cinchonidine (90) and the 
terpenes linalool oxide (91) and dihydromyrcenol 
(92), underwent efficient cyclopropanation with- 
out interference from the unprotected alcohols 
present in the molecular backbones. 

We anticipate that further development of the 
current photoredox protocol has the potential to 
transform the way that alkenes are cyclopropa- 
nated. In particular, discovery of new param- 
eters that can overcome current limitations for 
additional general classes of alkenes, such as 
styrenes and 1,3-dienes, and their adaptation in 
a flow system could open new avenues for 
implementation in large-scale process chemis- 
try in the future. 


REFERENCES AND NOTES 


1. H.M.L. Davies, J. R. Denton, Chem. Soc. Rev. 38, 3061-3071 
(2009). 

2. C.A. Carson, M. A. Kerr, Chem. Soc. Rev. 38, 3051-3060 
(2009). 

3. D. Zhang, H. Song, Y. Qin, Acc. Chem. Res. 44, 447-457 
(2011). 

4. T.T. Talele, J. Med. Chem. 59, 8712-8756 (2016). 

5. J. Salatin, in Small Ring Compounds in Organic Synthesis VI, 
A. de Meijere, Ed. (Springer, 2000), vol. 207 of Topics in 
Current Chemistry, p. 1-67. 

6. C. Ebner, E. M. Carreira, Chem. Rev. 117, 11651-11679 
(2017). 

7. W. A. Donaldson, Tetrahedron 57, 8589-8627 (2001). 

8. H. Lebel, J.-F. Marcoux, C. Molinaro, A. B. Charette, Chem. Rev. 
103, 977-1050 (2003). 

9. M. P. Doyle, in Modern Rhodium-Catalyzed Organic Reactions, 
P. A. Evans, Ed. (Wiley, 2005), p. 341-355. 

10. H. M. L. Davies, E. G. Antoulinakis, “Intermolecular metal- 
catalyzed carbenoid cyclopropanations” (Wiley, 2001), 
vol. 59 of Organic Reactions; https://doi.org/10.1002/ 
0471264180.or057.01. 

1. J. Rancourt et al., J. Med. Chem. 47, 2511-2522 (2004). 

12. F. P. Cald, A. Zimmer, G. Bistoni, A. Furstner, J. Am. Chem. Soc. 
144, 7465-7478 (2022). 


4 August 2023 


ot) 


BR 


co 


22. 


28. 


43. 


oO 
2 


56. 
57. 


. S. P. Green et al., Org. Process Res. Dev. 24, 67-84 


(2020). 


. S. P. Lathrop et al., Org. Process Res. Dev. 27, 90-104 


(2023). 


. B. Morandi, E. M. Carreira, Science 335, 1471-1474 (2012). 
. S. Zhu, J. A. Perman, X. P. Zhang, Angew. Chem. Int. Ed. 47, 


3460-8463 (2008). 


. Z.-L. Chen et al., Org. Lett. 24, 2232-2237 (2022). 
. M. J. R. Richter, F. J. Zécri, K. Briner, S. L. Schreiber, Angew. 


Chem. Int. Ed. 61, e202203221 (2022). 


. M. Bordeaux, V. Tyagi, R. Fasan, Angew. Chem. Int. Ed. 54, 


1744-1748 (2015). 


. D. Nam, V. Steck, R. J. Potenzino, R. Fasan, J. Am. Chem. Soc. 


143, 2221-2231 (2021). 


. B. J. Wittmann et al., ACS Catal. 10, 7112-7116 (2020). 


P. S. Coelho, E. M. Brustad, A. Kannan, F. H. Arnold, Science 
339, 307-310 (2013). 

A. B. Charette, A. Beauchemin, “Simmons-Smith 
cyclopropanation reaction” (Wiley, 2001), vol. 58 of Organic 
Reactions; https://doi.org/10.1002/0471264180.or058.01. 


. C. Jing, B. T. Jones, R. J. Adams, J. F. Bower, J. Am. Chem. 


Soc. 144, 16749-16754 (2022). 


. S. Sakurai, T. Inagaki, T. Kodama, M. Yamanaka, M. Tobisu, 


J. Am. Chem. Soc. 144, 1099-1105 (2022). 


. L. Zhang, B. M. DeMuynck, A. N. Paneque, J. E. Rutherford, 


D. A. Nagib, Science 377, 649-654 (2022). 


. J. P. Phelan et al., J. Am. Chem. Soc. 140, 8037-8047 


(2018). 

B. Xu et al., J. Am. Chem. Soc. 142, 6206-6215 (2020). 
A.-H. Li, L.-X. Dai, V. K. Aggarwal, Chem. Rev. 97, 2341-2372 
(1997). 


. J. D. Johnson, C. R. Teeples, N. R. Akkawi, S. M. Wilkerson-Hill, 


J. Am. Chem. Soc. 144, 14471-14476 (2022). 


. M-S. Liu, H.-W. Du, J.-F. Cui, W. Shu, Angew. Chem. Int. Ed. 61, 


€202209929 (2022). 


. O. G. Kulinkovich, A. de Meijere, Chem. Rev. 100, 2789-2834 


(2000). 


. E. J. Corey, S. A. Rao, M. C. Noe, J. Am. Chem. Soc. 116, 


9345-9346 (1994). 


. J. Ni, X. Xia, W.-F. Zheng, Z. Wang, J. Am. Chem. Soc. 144, 


7889-7900 (2022). 


. R. W. Coscia, T. H. Lambert, J. Am. Chem. Soc. 131, 


2496-2498 (2009). 


. H. Oumar-Mahamat, C. Moustrou, J. M. Surzur, M. P. Bertrand, 


J. Org. Chem. 54, 5684-5688 (1989). 


. D. Yang, Q. Gao, C.-S. Lee, K.-K. Cheung, Org. Lett. 4, 


3271-3274 (2002). 


. L-H. Jie, B. Guo, J. Song, H.-C. Xu, J. Am. Chem. Soc. 144, 


2343-2350 (2022). 
D. M. Fischer, H. Lindner, W. M. Amberg, E. M. Carreira, J. Am. 
Chem. Soc. 145, 774-780 (2023). 


. C. K. Prier, D. A. Rankic, D. W. C. MacMillan, Chem. Rev. 113, 


5322-5363 (2013). 


. J. M.R. Narayanam, C. R. J. Stephenson, Chem. Soc. Rev. 40, 


102-113 (2011). 


. D. C. Fabry, M. Rueping, Acc. Chem. Res. 49, 1969-1979 


(2016). 
G. Schmitz, Phys. Chem. Chem. Phys. 3, 4741-4746 (2001). 


. P. Zhang, C. C. Le, D. W. C. MacMillan, J. Am. Chem. Soc. 138, 


3084-8087 (2016). 


. Z.-L. Chen, Y. Xie, J. Xuan, Eur. J. Org. Chem. 2022, 


€202201066 (2022). 


. C.-J. Wallentin, J. D. Nguyen, P. Finkbeiner, C. R. J. Stephenson, 


J. Am. Chem. Soc. 134, 8875-8884 (2012). 


. M.A. Ischay, Z. Lu, T. P. Yoon, J. Am. Chem. Soc. 132, 


8572-8574 (2010). 


. K. Ide, M. Furuta, H. Tokuyama, Org. Biomol. Chem. 19, 


9172-9176 (2021). 


. A. G. Herraiz, M. G. Suero, Chem. Sci. 10, 9374-9379 


(2019). 


. P. Shiri, S. Ramezanpour, A. M. Amani, W. Dehaen, Mol. Divers. 


26, 2981-3002 (2022). 


. J. Cui et al., Proc. Natl. Acad. Sci. U.S.A. 108, 6763-6768 


(2011). 


. S. Huang, R. Li, K. R. LaMontagne, L. M. Greenberger, 


P. J. Connolly, Bioorg. Med. Chem. Lett. 21, 1815-1818 
(2011). 
T. lwaki et al., Bioorg. Med. Chem. 25, 6680-6694 (2017). 


. W. C. Cindy Lee et al., Chem 7, 1588-1601 (2021). 
. T. Y. Yuen, C. J. Brown, Y. S. Tan, C. W. Johannes, J. Org. 


Chem. 85, 1556-1566 (2020). 
F. Brackmann, A. de Meijere, Chem. Rev. 107, 4493-4537 (2007). 
D. Zhang et al., Synthesis 44, 705-710 (2012). 


8 of 9 


RESEARCH | RESEARCH ARTICLE 


58. H. Schubert, M. Rack, M. Mihlstadt, J. Prakt. Chem. 332, 
812-814 (1990). 

59. R. K. Banerjee, B. Dave, R. Hangarge, R. Wetal, V. P. Palle, 
R. K. Kamboj, Preparation of pyridopyrimidine compounds as 
MEK inhibitors for the treatment of diseases, WIPO patent 
W02016009306 Al (2023). 

60. B. Bertani, R. Di Fabio, F. Micheli, G. Tedesco, 
S. Terreni, Preparation of aza-bicyclic compounds as inhibitors 
of mono-amines re-uptake and antidepressant agents, WIPO 
patent WO2008031772 Al (2023). 

61. W.-L. Wu, D. A. Burnett, A. Stamford, J. N. Cumming, 
T. Asberom, C. Bennett, T. K. Sasiskumar, J. D. Scott, 
Preparation of spiroiminothiazine derivatives for use as 
BACE inhibitors, WIPO patent WO2013028670 Al 
(2023). 

62. 1,1-Disubstituted alkenes generated Heck type products rather 
than cyclopropyl rings. Alkynes generate cyclopropenes in 
less than 10% yields. 


Poudel et al., Science 381, 545-553 (2023) 


ACKNOWLEDGMENTS 


We thank H. Yennawar at the Pennsylvania State University for the 
x-ray crystallographic analysis of compounds 4, 32, and 37. The 
chemistry described herein has been submitted for a patent (US 
application no. 63/488,046). Funding: This work was supported by 
the National Institute of General Medical Sciences of the National 
Institutes of Health (NIH) (grant R35GM133438) and The 
Pennsylvania State University. The x-ray instrument was funded by 
the NIH (SIG S10 grants 1S100D028589-01 and 1S10RR023439- 
01). Author contributions: R.G. conceived of and directed the 
project and prepared the manuscript. R.G. and D.P.P. designed the 
experiments. D.P.P. performed experiments. D.P.P., A.P., R.K.T., 
and M.S. collected and analyzed data. Competing interests: 

The authors declare no competing interests. Data and materials 
availability: The supporting data for the findings of this study are 
available within the main manuscript and the supplementary 
materials. The x-ray crystallographic data for compound 4 (deposition 
no. 2224825), and hydrolyzed products of compounds 32 (deposition 


4 August 2023 


no. 2224826) and 37 (deposition no. 2224828) are available from 
the Cambridge Crystallographic Data Center (CCDC). License 
information: Copyright © 2023 the authors, some rights reserved; 
exclusive licensee American Association for the Advancement of 
Science. No claim to original US government works. https://www. 
science.org/about/science-licenses-journal-article-reuse 


SUPPLEMENTARY MATERIALS 
science.org/doi/10.1126/science.adg3209 
Materials and Methods 

Figs. Sl to S23 

Tables S1 to S33 

NMR Spectra 

References (63-71) 


Submitted 15 December 2022; resubmitted 22 March 2023 
Accepted 26 June 2023 
10.1126/science.adg3209 


9 of 9 


RESEARCH 


® 


GEOPHYSICS suggested aseismic expression of these f Check for 


Tremor signals during fluid injection are generated 


by fault slip 


Shankho Niyogi?*, Abhijit Ghosh’, Abhash Kumar’, Richard W. Hammack? 


Seismic tremor signals, also known as long-period, long-duration signals, have been reported in 
several locations where fluid injection for enhanced oil and gas exploration is taking place. However, 
the origin of these signals remains poorly constrained. We studied seismic tremor signals in Wellington 
Field, Kansas, using a seismic array during a carbon dioxide injection program. We show that these 
signals are generated below the surface during the time of carbon dioxide injection. They have a 
distinct spectral signature, similar to those observed in glacial and volcanic environments. The tremor 
sources are located near the injection site and aligned with preexisting faults. Modeling results imply 
that such tremors are generated by frictional slip on fault. These observations may reveal an important 
deformation mode, which is useful for studying associated stress, seismicity, and triggering, 

as well as for tracking fault activities during injection operations of all fluids, including 


supercritical carbon dioxide. 


O, injection in the subsurface can be 
performed both for sequestration and 
for enhanced oil recovery purposes. It is 
a discipline that has recently witnessed 
growth because of increasing efforts to 
mitigate global warming and growing energy 
demand. When CO,j is injected into an oil reser- 
voir, it reduces the viscosity of the residual oil 
and expands its volume, facilitating extraction, 
extending the productive life of the hydrocarbon 
field, and removing the injected CO, out of 
the atmosphere. However, the injection of these 
fluids can increase the pore pressure in the 
subsurface and potentially cause critically stressed 
faults near the injection site to slip. When seismic 
events such as earthquakes are generated from 
such a mechanism, this is known as induced 
seismicity. In contrast to natural earthquakes, 
some induced seismic events can show very 
different signal characteristics. Such signals are 
typically reported as long-period, long-duration 
events in several fluid injection sites (J-6). 
Physical mechanisms of such events, their 
underlying physics, and their relation to the 
injected fluid, however, remain enigmatic. 
The waveform characteristics of these long- 
period, long-duration signals are similar to the 
tectonic tremors found in large natural faults. 
Contemporary studies report that the pres- 
ence of fluids in faults and subduction zones 
is one of the factors behind the origin of tectonic 
tremors and is largely driven by aseismic slip 
(7-9). Laboratory studies have shown that 
slow injection rates can accelerate aseismic creep 
while maintaining stable slip due to the pressur- 
ized zone being confined within a critical size for 
an unstable slip in fault gouge samples (0). 
Similarly, aseismic deformation was found to 
be the dominant response when fluid injections 
were performed in natural fault systems in the 
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Low Noise Underground Laboratory [Labora- 
toire Souterrain Bas Bruit (LSBB)] of Rustrel, 
France, to induce slip (11). We also have evi- 
dence of fluid injection-induced earthquakes 
exhibiting broader body wave pulses and lower 
frequency coda (72), and these characteristics 
have been attributed to slower rupture speeds 
and lower stress drop values, which makes them 
similar to low-frequency earthquakes with regard 
to source characteristics. 

We observed low-frequency, long-duration, 
emergent signals in the seismic monitoring sta- 
tions [Incorporated Research Institutions for 
Seismology (IRIS) network identifier: ZA] placed 
around the injection well 2-32 in Wellington 
Field, Kansas, during a supercritical CO, injec- 
tion program (6) (Fig. 1). The supercritical CO, 
was injected into the Mississippian carbonate 
reservoir in a fluid state (6). Additionally, waste- 
water was being disposed of in the deeper, 
aquitard-bounded Arbuckle Group Formation. 
On the basis of our analysis of the United States 
Geological Survey (USGS) earthquake catalogs, 
the onset of the injection program did not cause 
an increase in local earthquakes, especially around 
the injection well. Most earthquakes in the south- 
central Kansas region are caused by oil and gas 
activities such as wastewater disposal from 
both local areas and from northern Oklahoma 
caused by far-field pressure diffusion (73). The 
locations of these earthquakes show that they 
are aligned with the structural features of this 
region, like the Humboldt Fault Zone and the 
Central Kansas Uplift (Fig. 1). Historical earth- 
quakes in Kansas have also taken place on the 
faults along these features. The Mississippian 
Group of rocks and the Cambro-Ordovician 
Arbuckle Group in Wellington Field are the tar- 
get lithology for CO.-enhanced oil recovery and 
storage, respectively (14). Exploration seismic 
data show that both of these formations are con- 
nected to the basement faults in Wellington Field 
(4). Initial studies of these low-frequency events 
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t 
movements (6). We focused on the coheren cyan 


spectral signatures of these events, which make 
them similar to harmonic tremors reported earlier 
from volcanic and glacial settings. We explain 
the signals using physical models derived from 
these sources. For simplicity, we refer to these 
long-period, long-duration events as tremors. 

Investigation of unknown seismic signals 
often involves a separate analysis for looking 
into possible anthropogenic sources. We con- 
ducted independent analysis of these events 
with regard to their origin from freight trains, 
vehicular traffic, aerial sources such as helicop- 
ters, and machinery equipment such as pumps 
and generators on the surface near the seismic 
array. The presence of equipment does create 
noise, which is shown in the clearly visible parts 
of the spectrograms. However, a number of events 
with high signal-to-noise ratio clearly establishes 
the characteristics of the tremor signal. We have 
used that to make a tremor catalog based upon 
these well-defined waveform characteristics. A 
notable characteristic of these tremor events 
are the lines gliding across frequencies between 
1 and 5 Hz observed in the spectrograms. For 
many of the events displaying similar spectral 
characteristics with gliding spectral lines, slow- 
ness and back azimuth could not be robustly 
determined because of the inherent noise con- 
tent. Therefore, we did not include them in our 
final analyses. Observations of such kinds of 
low-frequency tremor events, their source lo- 
cations, and their spatiotemporal distribution 
may provide a useful and benign way of track- 
ing the migration of injected fluids in the 
subsurface. Using an array response function 
with a synthetic waveform showed that the 
array could perform reasonably well at 1 to 
5 Hz, the dominant frequency band of interest, 
and up to at least 0.3 s/km, which was suitable 
for our analysis. 


Background 


The Arbuckle Formation consists mainly of 
interlayered dolomites and carbonates and 
was deposited in a shallow epicontinental sea in 
Kansas and Oklahoma during the late Cam- 
brian to the Middle Ordovician. Karst formations 
within it created permeable zones for waste- 
water disposal, sealed by the overlying Devonian 
Woodford shale formation. The Mississippian 
limestone formations were later deposited in 
similar shallow marine environment settings 
and are bounded by unconformities. Oil deposits 
of the Mississippian formations are mostly a 
combination of structural-stratigraphic traps 
where the present-day CO, injection is going 
on. Tectonic events such as the Nemaha Uplift 
and the Midcontinent Rift event influenced the 
structure of the Mississippian and Arbuckle 
Formation, creating deep vertical faults in the 
subsurface. Subsurface formations in southern 
Kansas were strongly affected by these tectonic 
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events (15-23), and this is where most earth- 
quake activity has occurred, including an esti- 
mated magnitude 5.1 event in 1867. 

Information about subsurface structures in 
Wellington Field comes from three-dimensional 
seismic reflection data and formation micro- 
imaging (FMI) well logs (/4). Analyzing the 
seismic data shows mostly northeast and some 
northwest trending subvertical faults, whereas 
FMI data show that most of the fractures were 
oriented toward the northwest, with some in 
the northeast direction. On the basis of the trend 
of the earthquake locations, faults, and fracture 
data, the tectonic episodes that created the 
Central Kansas Uplift and the Nemaha Ridge 
Humboldt Fault Zone also affected Wellington 
Field and generated similar structures in that 
area as well. 


Observations 


We found that the long-period, long-duration 
events (tremor events) occurred when the CO, 
injection was being performed between January 
and June of 2016. Kumar et al. (6) suggested that 
the filtered waveforms of these signals bear 
visual similarity to those of tectonic tremors 
found in geologic settings under natural tec- 
tonic stresses. These events contain the highest 
energy in 1 to 5 Hz, which is part of the 1- to 12-Hz 
frequency band in which tectonic tremors are 
usually visible. The events are not visible in 
regional seismic networks in this region, sug- 
gesting their local origin. By beamforming the 
tremor signals in 1 to 5 Hz, we showed that they 
have a low slowness value (<0.3 s/km), indicat- 
ing their subsurface origin. From the initial 
catalog of Kumar et al. (6), we selected several 
events with high signal-to-noise ratios that passed 
our strict quality tests (fig. S3). Using the wave- 
form characteristics of these events, we detected 
additional events not previously recognized 
during the injection period. Most of the signals 
had durations ranging from 2 to 5 min. After 
conducting thorough quality evaluations on 
these events, we ultimately selected 27 to locate 
using the beam backprojection technique 
(24-27). The locations of the signals were deter- 
mined by assuming the depth in which the 
supercritical CO, injections took place. This 
assumption is necessary because a single array 
cannot constrain depth of the signal indepen- 
dently. All of the locations that we obtained 
were within 5 km of the injection well. The 
spatial pattern of the locations is roughly aligned 
along northeast-southwest and northwest- 
southeast directions. These trends are also 
shared by the faults of the Central Kansas Uplift 
and the Nemaha Ridge Humboldt Fault Zone 
in this region. Schwab et al. used exploration 
seismic data to show that basement faults did 
extend to the horizons where the fluids were 
being injected (14). Schwab et al. also showed the 
presence of natural open fractures in the target 
formations, as shown by the FMI data from 
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Fig. 1. Location of the network in relation to earthquake occurrences and other tectonic structures in 
Kansas. (A) Location of the ZA network in Wellington Field. We analyzed three-component seismic data from 
December 2015 to July 2016. The area is shown by the blue rectangle in the inset map. (B) Historical and present 
earthquake occurrences and main structural elements in Kansas. The red highlighted box indicates Sumner 
County, and the yellow circle indicates Wellington Field. Earthquake magnitudes are represented by the area of the 
circles, and color indicates the distribution of historical and present-day seismicity. Most of the earthquakes follow 


the main structural trends. Figure is modified from Hildebrand et al. (53). 


wells 1-28 and 1-32. The northeast and north- 
west trend of the source locations of our events 
was similar to the distribution of regional earth- 
quakes in the USGS catalog and was also shown 
by the open fractures in the wells. 

We observed that the occurrence of the tremor 
signal and local earthquakes had no temporal 
correlation between them, even though their 
spatial trends were along the same northeast 
and northwest directions. No observable in- 
crease in the local background seismicity 
occurred during or after the supercritical CO, 
injection. Most earthquakes in the south- 
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central Kansas area stem from oil and gas 
operations, mainly from the disposal of waste- 
water (28, 29). The lack of increase in con- 
temporary earthquakes rates during the CO, 
injection may indicate that the injection rate 
was not high enough to generate additional 
earthquakes in the region. However, the dis- 
tribution of tremor signals was limited to the 
supercritical CO, injection period, and we did 
not detect these signals in the month before or 
after the injection period. 

The most distinct characteristic of tremor 
signals is the presence of narrow, more or less 
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Fig. 2. Similarity of the spectrogram characteristics between tremors found in this study and those 
found in other geologic environments. A spectrogram is a visual representation that shows how the energy 
in different frequencies changes over time in the data, with the x and y axes representing time and frequency, 
respectively. Here, we show a spectrogram of volcanic tremors (A) reported by Almendros et al. (54) at Arenal 
Volcano in Costa Rica, seismic tremors associated with glacial earthquakes (B) at Whilllans Ice Stream in Antarctica 
described by Pollyea et al. (29), and two examples of the tremor events detected in this study [(C) and (D)]. Note 
the gliding spectral lines in the spectrogram at 1 to 5 Hz. The presence of analogous spectrogram signatures 
within comparable frequency ranges highlights the similarity in the mechanisms generating these events. 
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evenly spaced bands of frequency that gradu- 
ally increase, reach a peak, and then decrease 
(Fig. 2). These time-varying spectral lines are 
also seen in tremor signals from glacial and vol- 
canic regions (30, 31) (Fig. 2). The spectral lines 
are characterized by relatively high amplitude 
and generally stay within the 1- to 5-Hz range. 
The peak frequency often aligns with the peak 
amplitude in the waveform in the time domain. 
Typically, we observed two spectral lines, al- 
though three lines were occasionally visible 
in the spectrogram. 


A physical model to explain tremors 


Mathematical models have been proposed to 
explain the occurrence of harmonic tremors de- 
tected near volcanoes. Dmitrieva et al. (32), using 
the general equations given by Hotovec et al. 
(3D, derived and then showed that a delta func- 
tion with increasing recurrence interval as a 
function of stress accumulation rate could 
generate the characteristic harmonic tremor 
signal seen near volcanoes. Dmitrieva et al.’s 
model shows that the signal is generated by 
shear slip induced by fluid in the volcanic sys- 
tem. In another study, Lipovsky and Dunham 
(30) showed that tremors detected by seismo- 
meters in Whillans Ice Plain, West Antarctica, 
occurred during large-scale sliding events, and 
this was confirmed by GPS stations. These 
tremor events are also characterized by gliding 
spectral lines and can be modeled on the basis 
of repeating earthquake patches at the ice-bed 
interface. Our tremor events bear similarities 
to both volcanic (37) and glacial tremors (30, 33) 
in terms of the gliding spectral lines. The sig- 
nals for both the volcanic and glacial tremors 
are produced by frictional processes. Signals 
showing discernable narrow bands of varying 
frequency with time are related to a varying re- 
currence interval of events, which in turn is 
related to the changing stressing rate a on the 
fault plane (32). We used the same mathemati- 
cal model to show that a pattern of increasing 
followed by decreasing stressing rate o can ex- 
plain the occurrence of the tremor signals within 
the 1 to 5 Hz identified in this study. 
According to Dmietrieva et al. (32), the spec- 
trum of N repeating events occurring periodi- 
cally with a recurrence interval value of T is 
given by multiplying an individual spectrum 
event by a spectrum of sequence of N delta 
functions [as derived in Hotovec et al. (31)]: 


v(o)~ —1 Ms exp (=) exp ( a ana 


© amper 2cQ sin(2) 


where the variable m is the angular frequency, 
v(w) is the Fourier transform of particle veloc- 
ity v(t), Mo is the seismic moment, p is the 
density of the propagating medium, 7 is the 
propagation distance, c is the shear wave speed, 
and Q is the quality factor. 
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Fig. 3. Analysis of the detected events through examination of nearby 

earthquake activities and injection program duration. (A) Source locations 
of tremor events as determined by the beam backprojection method (25, 26). 
Some tremor locations are aligned with the Central Kansas Uplift (northwest- 


southeast) and others with the Humboldt Fault Zone 


Regular earthquakes are aligned with the same structures. (B) Modeling of the 
tremor signal. Shown is an illustration of a series of earthquakes with temporally 
varying recurrence interval times as a result of increasing and decreasing 


Dmitrieva et al. (32) showed that the spec- 
tral peaks occur when w7/2 becomes an inte- 
ger multiple of x. The recurrence interval T is 
dependent upon stress t and stress accumula- 
tion rate a as T ~ dt/o. Considering nearly 
equal stress drop value for events generated 
on a fault plane, an increase in stress accu- 
mulation rate a leads to a decrease in T and 
vice versa. Any smooth change in the recurrence 
interval rate is reflected in the spectrograms of 
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these events as spectral lines increasing or 
decreasing in frequency as a function of time 
(equidistant signals are shown in Fig. 2, C 
and D, and fig. S3). Winberry et al. (33) reported 
the timings of the spectral crests of the observed 
tremors, and these match the highest rate of 
glacial earthquake occurrences. This compari- 
son indicates, as expected, that the occurrences 
of tremor signals with gliding frequencies are 


Jan 2016 


not restricted to volcanic environments alone. 
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rates of stress accumulation. (€) Illustration of multiple spectral lines as a 
result of the series of earthquakes shown in (B). A quadratic rate of change of 
events can simulate the pattern seen in the spectrograms of the observed 
tremor signals. (D) Expanded timeline of the histogram of the tremor events. 
(E) Occurrence of tremor events (black) plotted in the backdrop of earthquakes 
(blue) occurring within 50 km of the injection well. (F) Plot of the maximum 
magnitude earthquakes (green), along with the cumulative number of 


We modified some of the parameter values 
in Dmitrieva et al. (32) for modeling our tremor 
events because the frequency ranges of volcanic 
tremors and our events overlap. In our model- 
ing, we also assumed a linear and quadratic 
change of recurrence interval value 7. We found 
that multiple spectra with different starting and 
ending recurrence intervals generate signals 
that most closely resemble the ones seen in 
Kansas. As observed in the spectrograms, the 
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chief characteristics of the signal, the gliding 
equidistant spectral lines, can be modeled by 
multiple subsequent earthquakes with varying 
interevent times. 


Results and discussion 


Even though earthquakes have increased in 
the northern Oklahoma and southern Kansas 
region overall due to subsurface fluid injection 
(13, 34-42), we found no obvious change of 
background seismicity at the beginning of the 
CO, injection program for oil recovery in the 
Wellington Field region (Fig. 3E and fig. S2). 
We also saw evidence of natural fractures in 
those formations from borehole image logs 
oriented in the same northeast-southwest direc- 
tion as the Humboldt Fault Zone. Previous stud- 
ies showed that alterations in pore pressure 
resulting from fluid injections can trigger both 
seismic and aseismic activity in fault zones 
provided that the local stress conditions, hydro- 
mechanical coupling, and fluid diffusion pro- 
cesses are conducive to such occurrences (43). 
Thus, the tremor signals with gliding spectral 
lines in our study area can provide an expla- 
nation for why faults are releasing some of the 
accumulated stress without causing an increase 
in regular seismicity, i.e., earthquakes. We have 
determined that the characteristics of the wave- 
forms and spectra of the signals can be modeled 
as a series of shear slip events likely induced 
by fluid injection. On the basis of these find- 
ings, we infer that these tremor sources are 
driven by aseismic slip. 

Previous studies have shown that harmonic 
tremors are associated with stress state changes 
caused by the movement of magma and fluids 
in the subsurface (31, 44). Theoretical models 
such as those developed by Thomas and Neuberg 
(45) are valuable for studying source mechanisms 
because they are based on the observations of 
harmonic tremors. Therefore, we derived a 
physical model that is based on Dmitrieva et al. 
(32) and inferred that a temporally varying 
stress pattern can generate the tremor signals 
with gliding spectral lines observed in this area. 
Aseismic movement of faults can occur aided by 
increases in pore fluid pressure near the faults 
caused by fluid injections. Past studies showed 
that frictional properties can change during fluid 
injection from rate weakening to rate strength- 
ening with increasing fluid pressure, which helps 
to maintain aseismic slip (10). A correlation also 
exists among earthquakes, aseismic slip, and the 
pumping of fluids near subsurface faults, as 
shown in areas such as Delaware Basin in west 
Texas (46, 47). We relate the lack of increased 
seismicity in Wellington Field to aseismic fault 
movements that may have driven the localized 
tremor sources found in this area. 

The occurrences of these tremor events have 
implications for the subsurface movement of 
injected fluids. Tracking the movement of sub- 
surface fluids is usually done through simu- 
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lations and modeling (48, 49). Observations 
of these low-frequency tremor events and their 
source locations can help us track deformation, 
fracture location, and movement of injected 
fluids in the subsurface around the injection 
well. Determining the locations of induced 
earthquakes is one of the ways to deduce the 
extent of subsurface fluid migration from the 
injection well (73, 50-52). The injection rate of 
fluids for disposal and enhanced oil recovery 
purposes is monitored carefully to prevent in- 
duced earthquakes from increasing in frequency 
and magnitude. Therefore, observations of such 
tremor events, their source locations, and their 
spatiotemporal distribution may provide a use- 
ful and benign way of tracking the migration of 
injected fluids. 


Conclusions 


We analyzed tremor signals and showed that 
they originated from the subsurface during a 
CO, injection program in Wellington Field in 
Sumner County, Kansas. Using array analyses, 
we ruled out surface noises such as freight 
trains and helicopters as being the origin of 
these tremors. By locating the events using the 
beam backprojection method, we were able to 
show that all of the events occurred in the 
vicinity of the injection well and that they 
followed the major structural trends shown by 
the Humboldt Fault Zone and the Nemaha 
Uplift. This trend was also evident in the spatial 
pattern of earthquakes in Kansas over a long 
time scale. Based on the similarities in their 
waveforms and spectral characteristics, the 
closest physical analog of these signals are 
tremors with gliding spectral lines found in 
volcanic and glacial settings. These tremor sig- 
nals are modeled as being generated by a series 
of small earthquakes with temporally varying 
recurrence interval times. We infer that these 
tremors may be driven by aseismic slow slip. 
Such induced tremor events provide a way to 
track stress distribution and deformation dur- 
ing subsurface high-pressure fluid injection. 
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A stable rhombohedral phase in ferroelectric 
Hf(Zr);,,02 capacitor with ultralow coercive field 


Yuan Wang*“+, Lei Tao*+, Roger Guzman?, Qing Luo’2*, Wu Zhou®, Yang Yang", Yingfen Wei‘, 
Yu Liu”, Pengfei Jiang’, Yuting Chen*, Shuxian Lv", Yaxin Ding’, Wei Wei", Tiancheng Gong’, 


Yan Wang?”, Qi Liu*, Shixuan Du®***, Ming Liu>?4* 


Hafnium oxide-based ferroelectric materials are promising candidates for next-generation nanoscale 
devices because of their ability to integrate into silicon electronics. However, the intrinsic high coercive 
field of the fluorite-structure oxide ferroelectric devices leads to incompatible operating voltage and 
limited endurance performance. We discovered a complementary metal-oxide semiconductor (CMOS)- 
compatible rhombohedral ferroelectric Hf(Zr);,,02 material rich in hafnium-zirconium [Hf(Zr)]. X-ray 
diffraction combined with scanning transmission electron microscopy reveals that the excess Hf(Zr) 
atoms intercalate within the hollow sites. We found that the intercalated atoms expand the lattice and 
increase the in-plane and out-of-plane stresses, which stabilize both the rhombohedral phase (r-phase) 
and its ferroelectric properties. Our ferroelectric devices, which are based on the r-phase Hf(Zr):.,02, 
exhibit an ultralow coercive field (~0.65 megavolts per centimeter). Moreover, we achieved a high 
endurance of more than 10" cycles at saturation polarization. This material discovery may help to realize 


low-cost and long-life memory chips. 


he discovery of ferroelectricity in hafnium 
oxide (HfO,) inspired a renewed interest 
in ferroelectric devices (J-4). Ferroelectric 
HfO, exhibits robust electric dipoles at 
nanometer thicknesses (J, 2, 5) and is 
compatible with modern complementary metal- 
oxide semiconductor (CMOS) technology (6, 7). 
The compatibility with CMOS enables the 
realization of ultradense ferroelectric random- 
access memory (FeRAM), which makes FeRAM 
a competitive candidate for next-generation 
nonvolatile memory devices (4-6, 8-11). Yet, 
the high coercive field (E,) of ferroelectric Hf 


Fig. 1. Planar metal-ferroelectric- A 


metal (MFM) capacitors and 
structural characterization of 
Hf(Zr);,,02 films. (A) Cross- 
sectional STEM-HAADF image of 
the TiN/Hf(Zr);,,02/TiN structure 
with a thickness of 12.1 nm. 
Scale bar, 20 nm. (B) The XPS 
spectrum of Hf 4f in Hf(Zr),,,O2 
film. (©) The XPS spectrum of 

Zr 3d in Hf(Zr);4,O2 film. 

(D) Small signal CV hysteresis 

at 10 kHz (50-mV level) of D 


(Zr)Oz (HZO), leading to a high operating vol- 
tage, is a key issue that inhibits the application 
of HZO-based FeRAM at the state-of-the-art 
technology node (72). Moreover, the applied 
electric field for remnant polarization (P,) 
switching is around 60 to 80% of the breakdown 
strength, suggesting an inherently small mar- 
gin in securing the allowed number of field 
cycles before the breakdown (13). This problem 
is intrinsic to the fluorite-structure oxides, in 
which domain walls do not propagate effici- 
ently and can move only by hopping over a 
large energy barrier (1). 


dat 
electric Hf(Zr)O2 (o-phase HZO), the ery teeseatacteoan 


barrier between the +P, and -P, states can be 
lowered by increasing the stability of the 
intermediate state [the tetragonal phase of 
Hf(Zr)O, (t-phase HZO)]. A stable intermediate 
t-phase HZO will reduce the stability of the 
polarized o-phase and will lead to a lower P, 
and longer wake-up cycling (74-16). However, 
different work-function electrodes can make 
the antiferroelectric hysteresis loop shift in one 
direction, so that the t-phase switches between 
positive (or negative) and nonpolarized states. 
This method can reduce EF, by one-half and 
improve endurance to 10” cycles. Yet, the P, of 
single-sided polarization reversal in this t-phase 
antiferroelectric material is only one-half of 
the o-phase HZO (2P, ~ 16 uC/cm”) (17-19). The 
polarization switching of the r-phase depends 
not only on crystal polarization but also on ex- 
trinsic factors such as defect migration (2), 
which is different from that of the o-phase. 
Because of the high formation energy, the sta- 
bility of stoichiometric r-phase HZO is dependent 
on the stress imparted by the structure of the 
electrode, which is obtained only through 
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epitaxial growth (4). Therefore, a CMOS- 
compatible and stable HZO-based material in 
r-phase can not only fundamentally solve the 
dilemma of hafnia-based ferroelectric optimi- 
zation but can also retain the advantages of 
HZO material. 

We report a ferroelectric r-phase in a Hf(Zr),,,,02 
material and demonstrate a highly CMOS- 
compatible way for experimentally construct- 
ing the r-phase in HfO, materials by increasing 
the atomic ratio of Hf to O. We used x-ray diffrac- 
tion (XRD) and scanning transmission electron 
microscopy (STEM) experiments to confirm the 
existence of the r-phase. Using a high-angle annu- 
lar dark-field (HAADF) image of the Hf(Zr),,,,02 
film, we show the intercalation of excess Hf(Zr) 
atoms, confirming the Hf(Zr)-excess structure. 
Our density functional theory (DFT) calculations 
reveal that the formation energy of the r-phase is 
lower than those of the o-phase and monoclinic 
phase (m-phase) in Hf(Zr)-rich Hf(Zr)O, films 
when the ratio of Hf(Zr) to O is larger than 
1.079:2, which is more conducive to the forma- 
tion of the r-phase. We also conducted device 
experiments, which reveal a high P, value of 
22 uC/em?, a small saturation-polarization elec- 
tric field (,) [1.25 megavolts (MV)/cm], an ultra- 
low E, (~0.65 MV/cm), and a high breakdown 
electric field (Epp) (4.16 MV/cm). Moreover, we 
achieved a long device lifetime of more than 
10” cycles at saturation polarization. 


Preparation and structural characterization of 
r-phase Hf(Zr)1,,.02 


We fabricated and characterized metal- 
ferroelectric-metal (MFM) capacitors with 
Hf(Zr),,,0. films (thickness of 12 nm) to 
confirm their ferroelectricity. Ferroelectric 
Hf(Zr),,,O. layers can be simply obtained by 
using magnetron sputtering from Hf and ZrO, 
targets. We introduced a reactive oxygen source 
during sputtering (fig. S1). We show a cross- 
sectional STEM-HAADF image with the stack- 
ing of TiN/Hf(Zr),,,,02/TIN layers (Fig. 1A). Using 
energy-dispersive x-ray spectroscopy (EDS) 
maps (fig. $2), we confirmed that the TiN/ 
Hf(Zr),,,0.2/TiN device has structurally and 
chemically sharp interfaces without obvious 
interdiffusion. Our x-ray photoelectron spec- 
troscopy (XPS) analysis of Hf(Zr),,,O. films 
confirms that the Hf(Zr) content in the film is 
greater than that of standard stoichiometry. 
When the Hf source power was varied, the 7 
in Hf(Zr),,,0. films increased from 0.129 to 
0.273 (fig. S3A). The depth-profile analysis of 
Hf(Zr),,,O. films shows that the elemental 
proportion of Hf, Zr, and O in the thin layer does 
not change with the thickness (fig. S3, B to D). A 
careful fitting of the spectra (Fig. 1, B and C) 
reveals that the Hf** and Zr** components in 
the film coexist with metallic Hf and Zr°. We 
observed characteristic small signal capacitance- 
voltage (C-V) curves in MFM capacitors (Fig. 1D), 
which is common. The dielectric constant of 
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Fig. 2. Atomic-scale STEM analysis of the rhombohedral phase Hf(Zr)1.,02. (A) Cross-sectional 
STEM-HAADF image of the Hf(Zr);,,02 structure along the [1-21] zone axis in rhombohedral notation. Scale 
bar, 2 nm. (B) (Top) Close-up image of the Hf(Zr),,,O2 structure showing the intercalation of the excess 
Hf(Zr) atoms (highlighted by the white arrows). An atomic model is overlaid to guide the eyes. Scale bar, 
0.5 nm. (Bottom) FFT of the close-up HAADF image. (C) Atomic models of HfO2 with two kinds of 
rhombohedral structures and R3m structure with Hf excess (Hfi..O2). (D) HAADF image simulations along 
the [1-21] zone axis for different structures. The extra contrasts arising from the excess Hf(Zr) atoms are 


highlighted by the white arrows. 


the r-phase Hf(Zr),,..O2 film that we fabricated 
(€, = 33.15) is ~16.5% greater than that of the 
ferroelectric o-phase HZO film (fig. $4). To 
elucidate the ferroelectric properties of TiN/ 
Hf(Zr),,,02/TiN samples, we performed the 
measurements using the positive-up-negative- 
down (PUND) technique (fig. S5). We show a 
typical hysteresis polarization versus electric 
field (P-E) loop of the TiN/Hf(Zr),,,.,0./TiN 
device after 550°C of rapid thermal anneal- 
ing (Fig. 1E), indicating an ultralow E, of 
—0.58 MV/cm and +0.73 MV/cm. We achieved 
a large P, value of 22 uC/ cm? at a small driv- 
ing electric field of 1.25 MV/cm. The e, and 
E,, showed characteristics different from those 
of o-phase ferroelectrics. As a result, we carried 
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out grazing incidence x-ray diffraction (GIXRD) 
on the Hf(Zr),,,0. film (Fig. 1F), which in- 
dicated that the polycrystalline sample con- 
tains both m-phase and r-phase. Typically, 
the reflection peaks of 0(111) and t(011) of HZO 
films appear around 30.5° and 30.8°, respec- 
tively (4, 20). However, the 20 value of this 
Hf(Zr),,,O. film is slightly lower at around 
30.1°. To further confirm the lattice structure 
of the Hf(Zr),,,O02 thin film, TEM analysis of 
this Hf(Zr),,,O. film was necessary. 

We show a high-resolution HAADF-STEM 
image of the Hf(Zr)-rich Hf(Zr),,,,05 film viewed 
along the [1-21] zone axis (unless specified, we 
use rhombohedral notations for crystallography 
indices hereafter) (Fig. 2A). A close-up image 
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Fig. 3. Ferroelectric Hf(Zr);.,02 predicted by DFT calculations. (A) Atomic 
configurations of ferroelectric rhombohedral HfO2 and Hfj,9302. The extra Hf 
atom is marked in blue. The dashed line is the pristine rhombohedral lattice, and 
the solid lines are the hexagonal lattice. The [111] direction of the rhombohedral 
lattice is aligned with the c axis of the hexagonal lattice. (B) Formation energy 
of Hfj.,O02 in r-, o-, and m-phases as a function of x (upper) and Zry4,Oz in r-, t-, 
and m-phases as a function of x (lower). The linear-fitted lines show the 
structural transition from orthorhombic Hfj.,O2 (tetragonal Zr4,02) to rhombo- 
hedral Hf},.02 (Zry4,02) is at 0.039 (0.025), and the transition from monoclinic to 
rhombohedral Hfj..02 (Zr14,02) is at 0.118 (0.079), indicating that the m-r phase 


transition for the binary Hf(Zr);.,O2 occurs when the value of x is larger than 
0.079. (C) Polarization of r-phase HfO2 and Hf,,,O2 as a function of dj. 

(D and E) Energy landscape showing the ferroelectric switching process of HfO2 
and Hf,.og02 with diy of 3.11 A, respectively. (F) Energy differences (AE) between 
centrosymmetric (CS) r-phase Hf,,,02 and ferroelectric (FE) r-phase Hf,,,O2 as 

a function of dj1;. These energy differences represent the estimated ferroelectric 
barriers. (G) Energy landscape across the ferroelectric switching pathway in 
the 14-unit-cell-wide r-phase Hf,.9g02 with diy of 3.11 A as a function of the 
flipped unit cell. (H) Configuration of domain wall separating the 180° polar 
domains according to the black rectangle in (G). 


(Fig. 2B) shows the intercalation of excess 
Hf(Zr) atoms into the structure, forming an 
ordered array within the hollow sites, which is 
in agreement with our Hf-excess model (Fig. 2C). 
Along this crystal projection, the occupation in 
the excess Hf(Zr) columns is about 1/6, thus the 
Z-contrast of the excess Hf(Zr) sites is weaker 
than that of the regular Hf(Zr) columns. Corre- 
sponding fast Fourier transform (FFT) in the 
lower panel of Fig. 2B perfectly matches with 
the simulated electron diffraction patterns of 
Hf-excess R3m[1-21] phase (fig. S6B). The dy 
spacing that we directly measured from FFT is 
3.074 A. Then we compared the experimental 
FFT with those of the other structures in fig. S6C. 
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The o-phase could be excluded, which is charac- 
terized by the extinction of (210) diffraction spots 
marked by the dashed red circles in the simulated 
electron diffraction pattern. To further confirm 
the Hf-excess structure, we performed HAADF 
image simulation of the R3 phase, the R3m phase 
without excess Hf, and the R3m phase with Hf 
excess (Fig. 2D). The R3 phase shows an obvious 
lattice modulation forming Hf-Hf dumbbells, 
which we did not observe in the other two 
structures, nor in our experiments. The im- 
age simulation of R3m structure with Hf ex- 
cess is in agreement with the experimental 
observations. Figure S7 shows the intensity-line 
profile comparison between the experimental 
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and the simulated structures, illustrating the 
contrast modulation within the hollow sites, 
which confirms the ordering of the Hf(Zr) ex- 
cess atoms. 


DFT calculations 

The ferroelectric property of Hf(Zr)Oz is believed to 
originate from the spontaneous centrosymmetry- 
breaking of the r-phase Hf(Zr)O., induced by 
the displacement of Hf(Zr) and corresponding O 
(fig. S8), and is closely related to the applied 
strain in the [111] direction. However, the for- 
mation energy of the r-phase Hf(Zr)O, is much 
higher than those of other phases. By using 


DFT calculations, we calculated the formation 
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energies of the Hf-rich Hf,,,,0. in r-, o-, and 
m-phases, and Zr-rich Zr,,,O, in r-, t-, and 
m-phases with increasing Hf(Zr) ratio (Fig. 3B). 
Using HfO./ZrO, instead of binary HZO to do 
the calculations is acommon strategy, which 
has been widely used to estimate the forma- 
tion energies in two extreme cases (4, 21-23). 
We show the resulting atomic structure of 
stoichiometric r-phase HfO, (Fig. 3A, left) and 
Hf, 90. (Fig. 3A, right) with an extra Hf atom 
(blue balls) at the hollow site bonding to the 
neighboring oxygen atoms in a hexagonal super- 
cell marked by solid lines. The r-phase becomes 
more stable than o-phase Hf,,,0.2 when « is 
>0.039, whereas in Zr}.,Oo, the value of x for r-t 
phase transition is 0.025 (Fig. 3B). When « is 
>0.079 for Zr,,,.O and >0.118 for Hf,,,,O., the 
r-phase Hf,,,00/Zr,,,02 has the lowest forma- 
tion energies, which are even lower than the 
ground state usually reported for the nonferro- 
electric m-phase, resulting in a ground-state fer- 
roelectric Hf,,.,02/Zr},,0.. Therefore, for binary 
HZO, the phase transition from m-phase to 
r-phase of Hf(Zr),,.,02 should occur when the 
value of x is >0.079. We further verified the 
structural stability of Hf, 9,0. by calculating 
the phonon dispersion (fig. S9A). Although the 
excess Hf atoms will introduce extra strain in the 
lattice, the structure can be stabilized by forming 
a new phase with ordering Hf(Zr) excess atoms. 
Therefore, we used Hf,9g02 in the simulation 
that follows. In addition, the formation energies 
of Hf interstitial with oxygen vacancy and anti- 
site defect (fig. S10) indicate that the presence 
of Hf or Zr interstitials is unlikely to cause ad- 
ditional defects, such as oxygen vacancies. As 7 
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Electric Field (MV/cm) 


increases to 0.125, the r-phase Hf, 12,02 remains 
well ordered at room temperature (fig. S9B), 
indicating the robustness of ferroelectricity 
in r-phase Hf,,,,02. By contrast, the structure 
of o-phase Hf;,,0. has almost lost its ferro- 
electricity owing to the random movement of 
the oxygen atoms (fig. S11) when 2 is 0.125. 

The ferroelectric polarization in r-phase HfO, 
is believed to be induced by the lattice expansion 
in the [111] direction (4). The stoichiometric 
r-phase HfO, has a very small negative polari- 
zation if dy is <3.06 A (Fig. 3C). If dy is >3.06 A, 
the value of polarization of HfO. becomes 
positive and keeps increasing. After the inter- 
calation of excess Hf atoms, the value of dy, 
becomes 2.94 A for Hf, 9g02, which is 1% greater 
than that of stoichiometric r-phase HfO, (jy = 
2.91 A). Therefore, the lattice expansion in the 
[111] direction of the r-phase Hf(Zr),,,0. not 
only comes from the electrode but also from the 
intercalation of excess Hf(Zr) atoms. Similarly 
to the stoichiometric r-phase HfOs, the polari- 
zation of Hf,,9g02 depends on the value of dy, 
in which the polarization increases to 15 uC/cm? 
with a dy, of 3.1 A. The existence of Zr can 
stabilize r-phase Hf(Zr),..,0. (24) without chang- 
ing ferroelectric properties substantially (figs. 
$12 and S13). 

Our STEM experiment reveals that the ratio 
between the in-plane (-101) and out-of-plane (111) 
lattice parameters is ~1.14 when the Hf(Zr),,,,0. 
sample is viewed along the [1-21] zone axis (figs. 
S6 and S15). This value is close to that of a struc- 
ture with a dyy of 3.11 A (ratio ~1.124) according to 
DFT calculations (fig. S14), which corresponds to 
a strain of ~’7%. We show the ferroelectric switch- 
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Endurance (Cycles) 


Es/Eb (%) 


ing pathway of 7%-strain Hf, 9302, which has a 
switching barrier of only 7.6 meV per atom (Fig. 3, 
Dand E). Comparing that with stoichiometric 
HfO., which has a barrier of 20 meV per atom, 
we find that the electric field for ferroelectric 
switching is much smaller in r-phase Hf; 9g02 
than that in stoichiometric r-phase HfO,. Al- 
though the switching barriers of both HfO, 
and Hf, 90 increase with dj, (Fig. 3F), the 
switching barrier of r-phase Hf,9g0. is con- 
sistently much lower than that of r-phase HfO, 
at different dy. 

Unlike the ferroelectric HfO, in o-phase with 
localized dipoles, the ferroelectric Hf,,,0. with 
spread dipoles has quite broad domain walls, 
which contribute to a low E,. We show the 
energy landscape across the ferroelectric switch- 
ing pathway in the 14-unit-cell-wide Hf, 9302 
with a dy, of 3.11 A as a function of the flipped 
unit cell (Fig. 3G). The energy increases to 
0.33 eV when two-unit-cell Hf; 9g0z is flipped 
and decreases to 0.29 eV after flipping four- 
unit-cell Hf, 9g0.. The energy variation with 
more flipping unit cells indicates that the width 
of the domain wall in the r-phase Hf,.9gOz is at 
least 1.2 nm (two-unit cell of Hf,9g02) and that 
the required energy for creating a 180° domain 
wall in the Hf, gO. is about 0.22 eV/nm?. 
Figure 3H shows the domain-wall structure 
separating the 180° polar domains according to 
Fig. 3G, in which the polarization is suppressed 
substantially. These domain walls with spread 
dipoles in the r-phase Hf,9gO0. result in a rela- 
tively small ferroelectric switching barrier. We 
show the ferroelectric switching barriers in the 
broad domain wall (fig. S16) according to Fig. 3, 
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Gand H. The barrier for flipping all the dipoles 
in each unit cell independently is 0.28 eV per 
unit cell including 37 atoms (7.6 meV per atom 
in Fig. 3E), whereas it is only 0.12 eV per unit 
cell in the domain wall (fig. S16B). Although 
this energy barrier is much smaller than that 
of the oxygen vacancies’ migration (25), oxygen- 
vacancy migration is still possible during long- 
term electric field operation. 


Electrical characterization of 
rhombohedral Hf(Zr):.,02 


The process window is a key parameter of semi- 
conductor technology (26, 27). The larger the 
parameter is, the more conducive to the control 
of product yield (26). The r-phase Hf(Zr),,,0. 
ferroelectric thin films with the experimen- 
tally measured value of 2 >0.129 have good 
stability. For r-phase Hf(Zr),,,O.2 ferroelectric 
thin films prepared with different Hf power 
(Fig. 4A), although the z in Hf(Zr),,,0. films 
varied from 0.129 to 0.273 (fig. S3A), there 
was almost no change in the P, value and E,, 
with only 2.37 uC/em? and 0.17 MV/cm varia- 
tions of 2P, and 2E,, respectively (fig. S17A). 
We measured the P-E curves of the r-phase 
ferroelectric Hf(Zr),.O, capacitor at different 
electric fields (Fig. 4B). The A2P, and A2E, are 
only 3.2 wC/cm? and 0.01 MV/cm, respective- 
ly (fig. S17B). This indicates that the EF, of the 
r-phase ferroelectric Hf(Zr),,,02 capacitor is 
only 1.25 MV/cm, and thus we can achieve a 
large P, value of 22 uC/cm?. We measured the 
current versus electric field U/-E) curves of 
the r-phase ferroelectric Hf(Zr),,.,0. capacitor 
with different Hf(Zr) compositions and differ- 
ent electric fields (fig. S17, C and D). Figure 4C 
summarizes the P, and E, of ferroelectric 
HfO.,-based films from the literature. The r- 
phase ferroelectric Hf(Zr),,.,02 capacitor shows 
the minimum £,. We also measured the P-E 
curves of r-phase Hf(Zr),9 0. films with differ- 
ent thicknesses (fig. S18). We only found a slight 
change in P, value and EF, between 8.8- and 
12.1-nm Hf(Zr); 290. films. The P, value of the 
6.4-nm film decreases to 14 1C/cem?, and its E, 
(0.9 MV/cm) is slightly larger than those of 
other thicker films, indicating that the coer- 
cive field increases with decreasing film thick- 
ness (28). Moreover, we obtained more than 
10 years of data retention at 85°C (fig. S19). The 
device-to-device variation of r-phase Hf(Zr),,,.Oo.- 
based ferroelectric capacitors has been studied 
by using the statistical distribution of P-E curves 
(fig. S20A). The distribution of 2P, and 2E, re- 
mains in a narrow range (fig. S20B), indicat- 
ing small device-to-device variation. We show 
reproducibility data for r-phase Hf(Zr); o902 
ferroelectric capacitors in four manufacturing 
batches (fig. S21). Considering manufacturing 
process errors, we believe that the slight differ- 
ences in P, values and EF, between Hf(Zr); 990. 
ferroelectric capacitors from different manu- 
facturing batches are acceptable. 
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The limited number of field cycles of the o- 
phase ferroelectric HfO.-based devices is a 
natural consequence of such films’ high EF, 
relative to their breakdown field strength (13). 
Although reducing the operating electric field 
can prolong the device lifetime, incomplete 
polarization occurs, resulting in the reduction 
of the P, value. This phenomenon is called the 
“P.-endurance dilemma” (13). Therefore, the 
key to solve this problem is to reduce the E, 
and EF, of HfO.-based ferroelectric films. Com- 
pared with o-phase ferroelectric HfO.-based 
films, r-phase ferroelectric Hf(Zr),,,0.-based 
films have a smaller E, and E, as a result of 
their lower kinetic-energy barrier between the 
two spontaneous polarization states. The fer- 
roelectric capacitors achieve saturation polar- 
ization at an ultralow electric field (1.25 MV/cm) 
because of their smaller coercive field, E,. An- 
other important factor affecting the reliability 
of ferroelectric HfO2-based films is leakage cur- 
rent and Epp. We plot the /-E curves of o-phase 
HZO ferroelectric and r-phase Hf(Zr),,,0. 
ferroelectric devices (Fig. 4D). The peak high- 
lighted with an orange ellipse in the de current 
voltage U-V) curve shows a negative differen- 
tial resistance-type behavior, suggesting that 
this peak is a result of a combination of polar- 
ization switching and ion-migration phenomena 
(2, 29). Compared with the o-phase ferroelectric 
HfO, cell, the leakage current of the r-phase 
Hf(Zr),,,0. device is reduced by an order of 
magnitude at 3 V dc voltage, and its Epp is in- 
creased by 25%. Moreover, the E,/Egp is about 
30%, which is only one-half that of the o-phase 
HfO,-based ferroelectric device. Therefore, an 
r-phase Hf(Zr);,.,0. ferroelectric cell can ob- 
tain 10” field cycles at 1.25 MV/cm (Fig. 4E). 
We also obtained the endurance performances 
at different electric fields (fig. S22). These plots 
indicate great optimization potential for the 
endurance of these devices (30). When com- 
pared with the cycles to breakdown at satura- 
tion reported in the literature (Fig. 4F), the 
cycles of our r-phase ferroelectric Hf(Zr),,,O.- 
based films have been improved by about two 
orders of magnitude. 


Conclusions 


We reported a r-phase ferroelectric Hf(Zr),,.,0. 
thin film. The proposed ground-state r-phase 
ferroelectric Hf(Zr),,,O. film solves the prob- 
lem of high F, inherent in o-phase ferroelectric 
HfO,. Furthermore, these r-phase Hf(Zr),,.,0. 
ferroelectric thin films are CMOS-compatible. 
HAADF-STEM imaging of the Hf(Zr);,,0. 
film shows the intercalation of excess Hf(Zr) 
atoms, confirming the Hf(Zr)-excess structure. 
The excess Hf(Zr) atoms expand the lattice of 
Hf(Zr);,.02 and produce greater in-plane and 
out-of-plane stress, which is more conducive 
to stabilizing ferroelectric properties. An ultra- 
low coercive field is obtained owing to Hf em- 
bedding in HfO., which decreases the r-phase 
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switching barrier and the spread of dipole- 
induced broad domain walls. High Egp, large 
P,, and good endurance are achieved in the 
proposed ferroelectric Hf(Zr),,,0. capacitor 
simultaneously. Consequently, the discovery 
of the r-phase Hf(Zr),,,0. ferroelectric films 
may contribute to realizing low-cost and long- 
life memory chips. 
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Pervasive within-species spatial repulsion among 


adult tropical trees 


Michael Kalyuzhny™, Jeffrey K. Lake”, S. Joseph Wright®, Annette M. Ostling"* 


For species to coexist, performance must decline as the density of conspecific individuals increases. 
Although evidence for such conspecific negative density dependence (CNDD) exists in forests, the 
within-species spatial repulsion it should produce has rarely been demonstrated in adults. In this study, 
we show that in comparison to a null model of stochastic birth, death, and limited dispersal, the adults 
of dozens of tropical forest tree species show strong spatial repulsion, some to surprising distances of 
approximately 100 meters. We used simulations to show that such strong repulsion can only occur 

if CNDD considerably exceeds heterospecific negative density dependence—an even stronger 

condition required for coexistence—and that large-scale repulsion can indeed result from small-scale 
CNDD. These results demonstrate substantial niche differences between species that may 


stabilize species diversity. 


ow hundreds of species coexist on small 
spatial scales remains a question. Some 
tropical forests contain more than 250 
species of trees in a single hectare, or more 
than 1100 species of trees and shrubs 
in 25 ha (1). Coexistence theory posits that 
species can coexist if the following three con- 
ditions are met: First, species should be suf- 
ficiently similar in their competitive ability 
(2, 3). Second, the performance of species should 
decrease as they become more common, pre- 
venting any one species from taking over. This 
phenomenon is known as conspecific nega- 
tive density dependence (CNDD) (2, 4). Third, 
CNDD must be stronger than heterospecific 
negative density dependence (HNDD) (2, 4), 
implying that species mostly “suffer” from com- 
petition with their own kind because of niche 
differences between species that stabilize pop- 
ulation size. Such CNDD can be caused by com- 
petition for specialized resources (2, 5), but in 
tropical trees, species-specific natural enemies— 
herbivores and pathogens, which attack seeds 
and juveniles—are believed to be a major driver 
of CNDD (5-8) and diversity maintenance. 
Considerable evidence has been presented 
for CNDD acting on early life stages in tropi- 
cal and temperate forests, and variation in 
CNDD has been hypothesized to underlie the 
latitudinal gradient from high tree-species diver- 
sity at the equator to lower diversity at higher 
latitudes (2, 6, 9). For example, the abundance 
of seedlings, seeds, and adults has been shown 
by numerous works to reduce the number and 
survival of conspecific seedlings and saplings 
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(2, 10-13), and several works have also presented 
evidence that such CNDD is stronger than 
HNDD (10, 12). Nevertheless, the validity of 
many of these findings has been questioned 
on methodological grounds (4, 17). Moreover, 
adult tropical trees can live for hundreds of 
years (J4) and integrate density-dependent 
effects throughout their life. CNDD at one life 
stage can be offset by positive effects at an- 
other (4, 75) or have a weak effect on lifetime 
performance. This raises the question of the 
overall integrated effect of conspecifics on 
populations, which determines long-term co- 
existence (4). 

A potential approach to evaluating the inte- 
grated effect of CNDD across life stages is 
through the study of the spatial distribution of 
adult trees. Because CNDD has been shown to 
operate at short spatial scales around adults 
[~20 m (2, 10, 13)], it is predicted to generate 
spatial “repulsion” between adults, potentially 
leading to spatial overdispersion (16, 17). How- 
ever, classical studies of the within-species 
spatial distributions of adult trees found them 
to be almost ubiquitously aggregated in tropical 
forests (18-20). Hence, individuals are typically 
closer to each other than expected, especially 
at small scales at which CNDD has been shown 
to be the strongest (78-20). Furthermore, many 
spatial patterns in tropical forests resemble 
the expectations of neutral models (21-23), 
incorporating only dispersal limitation and 
stochastic demography, or reflect habitat het- 
erogeneity (24-26) with little evidence of CNDD 
(27). These findings suggest that CNDD observed 
at early life stages is severely diluted, over- 
whelmed, or obscured before trees reach adult- 
hood by processes that create aggregation, 
such as habitat specificity (28), facilitation (29), 
and dispersal limitation (17, 30), calling the 
importance of CNDD for diversity maintenance 
into question. 

Classical studies on the spatial patterns of 
adult trees compared their distributions with 
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(16, 18, 19). This model distributes individuee 


with equal probability across the (often arbi- 
trary) survey area, inevitably assigning some 
to sites that they would not be able to reach 
because of dispersal limitation (Fig. 1, A and B). 
We believe that a more ecologically sound null 
model is Dispersal Limitation [spatially explicit 
neutral theory (21, 31], which assumes that 
abundance and distribution are determined by 
stochastic births and deaths and random yet 
limited dispersal. The use of such a null model 
should incorporate the aggregation created by 
dispersal limitation into the null expectations 
(Fig. 1C and figs. S1 to S3), potentially reveal- 
ing the signals of biotic interactions and habitat 
specificity that have shaped the distributions of 
adults through their lives. 

In this study, we compared the within- 
species spatial patterns of adult tropical trees 
in a Panamanian forest with their expectation 
under the Dispersal Limitation null model by 
using previously estimated species-specific 
dispersal-distance distributions (32). Our goal 
was to examine the prevalence, magnitude, and 
scale of overdispersion and aggregation with 
respect to Dispersal Limitation because these 
patterns reflect the overall integrated effect of 
density dependence on individuals through 
their ontogeny. Because spatial repulsion is the 
signature of CNDD (6, 28)—which is caused by 
local interactions and has been shown to oper- 
ate on small spatial scales (~20 m) (2, 10, 13)— 
we hypothesized that some species for which 
the effect of CNDD propagates to shape adult 
distributions would show overdispersion at 
small scales. We then used a spatially explicit 
simulation model to examine whether the 
within-species spatial patterns we found can 
indicate whether CNDD is stronger than HNDD 
across life stages. 


Quantifying spatial patterns 


We performed the analysis with 41 species 
that had reliable estimates of dispersal distance 
distributions, which were obtained using known 
locations of seed-bearing trees and seed arrivals 
(32) in the Barro Colorado Island 50-ha forest 
plot (33). These estimates have been corrobo- 
rated by using genotyped seeds and adults (34), 
highlighting their reliability. We ran Dispersal 
Limitation simulations for every species sepa- 
rately, parameterized the simulations with the 
dispersal distance distributions, and compared 
the observed spatial distribution of adults of 
each species in 2015 with the expectation under 
Dispersal Limitation. We also compared ob- 
served spatial patterns with a Random null to 
highlight the differences in insight that emerge. 
For our comparisons, we developed analogs of 
the two most used point-pattern statistics: the 
Clark-Evans nearest-neighbor statistic and the 
relative neighborhood density (i.e., pair correla- 
tion function) (16, 18, 19) statistic. We compareed 
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the observed mean [or median; (35)] distance 
of trees to their nearest conspecific neighbor, 
NNDobps, to its value under the null (VND) 
using the excess distance (ED) statistic: 


ED = log(NNDpps) — log(NNDyun) (1) 


Fig. 1. Spatial point patterns 

of adult Beilschmiedia pendula 
trees. (A) The observed pattern 
appears aggregated compared with 
(B) realizations of the Random null model 
but overdispersed compared with 
(C) the standard variant of the Dispersal 
Limitation null model. Both nulls use 
the same abundance as the observed. 
The mean dispersal distance of 
B. pendula is estimated at 10.7 m (32). 
Figure S1 presents a visualization 

of the processes in the Dispersal 
Limitation null model and its variants, 
and fig. S2 presents realizations of 

the other variants of Dispersal 
Limitation, whereas fig. S3 zooms 

in on the clump of trees at the 

top left of (C). 


Y (m) 


where the averaging (denoted by an overbar in 
equations 1 and 2) is over samples of the Dis- 
persal Limitation or Random null models. De- 
tails on how we calculated ED can be found in 
the supplementary materials (35). We also 
computed the Excess neighborhood Abundance 
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[EA(r)], namely the observed number of con- 
specific neighbors of an average tree at dis- 
tances r to r + Ar, Nops(7), relative to its value 
under the null Naun(”) 

EA(r) = log(Nops(7)) = log(Naun(7)) (2) 
To examine the robustness of our results, we 
repeated this analysis with the standard Dis- 
persal Limitation null model and with five 
additional variants that make different assump- 
tions about the demography and dispersal of 
species (35), including alternative dispersal 
kernels, a time lag from dispersal to recruitment, 
and demographic trends [variant descriptions 
and illustrations (figs. S1 to S3) can be found in 
(35)]. We also argue and show that our anal- 
ysis is qualitatively robust to additional factors 
and processes that affect spatial distribution 
patterns and the ability to evaluate them (figs. 
S4 to S8) (35), including dioecy (fig. S4), clus- 
tered dispersal that may result from animals 
depositing clusters of seeds (fig. S5) (36), and 
edge effects (fig. S6). Furthermore, we demon- 
strated statistical power to detect repulsion 
even for rare species (fig. $7). 


Strong and widespread repulsion in 
adult trees 


The tree species that we studied are typically 
aggregated and never significantly overdispersed 
compared with the Random null model (Table 1), 
which is in line with previous findings (/8, 19). 
However, the same species are almost ubiqui- 
tously and strongly overdispersed on short 
scales (~20 m) when compared with Dispersal 
Limitation (Table 1). The results are qualita- 
tively similar in all six variants of the Dispersal 
Limitation null model (table S1), with trees 
having 50 to 80% fewer conspecific neighbors 


Table 1. Most species are overdispersed compared with Dispersal Limitation but are aggregated compared with a Random null model. Distributions 
and significance of the spatial statistics of Barro Colorado tree species in 2015 were calculated with either the standard version of the Dispersal Limitation 
model or the Random model as reference. We compared the abundance of conspecific neighbors within 20 m (left; Eq. 2) and the nearest-neighbor distance 
(right; Eq. 1) to expected values from null models. For abundances (left), negative values represent lower neighbor density than expected, indicating repulsion 
and vice versa. For distances (right), positive values represent greater distance than expected, indicating repulsion. Exponentiating the statistics would give 
the factor by which the observed density or distance exceeds expectations. The mean + standard deviation (SD) and the range (in square brackets) of 

the statistics over all species are presented, as well as the number of overdispersed and aggregated species and the number of statistically significant (with 
a = 0.05) species in parentheses. Excess Abundance, EA; Excess nearest-neighbor Distance, ED. 


Null Values + SD 
[range] 

: He -15+12 
Dispersal Limitation [-3.9, 13] 
Rancor 0.72 + 0.66 

FOS, 25] 


EFA atr=20m ED 

Overdispersed Aggregated Values + SD Overdispersed Aggregated 

(significant) (significant) [range] (significant) (significant) 
0.9 + 0.61 

ee ee eee nk NEN co” eee nen ae 
-0.2 + 0.19 

6 (0) 35 (24) [-0.77, 0.095] 5 (0) 36 (24) 
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B Dispersal Limitation null 
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Excess Abundance [EA(n)] 
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Fig. 2. Spatial distributions of four typical species and their EA(r). Calculation of EA(r) can be found in Eq. 2. Each row presents a different species, whereas 
the columns present for each species the observed distribution [(A), (D), (G), and (J)] versus a single realization of the standard Dispersal Limitation null 

[(B), (E), (H), and (K)]. EA(r), calculated by using 10 m distance bins, is presented in the shaded column [(C), (F), (I), and (L)], with the orange region indicating the 
95% global envelope (48) of the Dispersal Limitation null. EA® (r) of 0 indicates agreement with Dispersal Limitation, whereas values <O indicate overdispersion. 
All distribution maps are 1000 by 500 m. Compared with Dispersal Limitation, trees have considerably fewer neighbors than expected up to ~100 m. 


than expected within 20 m, and their nearest 
conspecific neighbor being 1.8 to 2.5 times more 
distant than expected (these numbers are the 
values of the mean statistics in Table 1 and 
table S1, exponentiated). Figure 2 shows typi- 
cal, monotonically increasing EA(r) curves for 
four species, along with their distribution 
maps in comparison with the expectations of 
the Dispersal Limitation null model, whereas 
figs. S9 and S10 show EA(r) for all species 
analyzed that had more than 24 adults in the 
plot. Unlike previous findings of positive con- 
specific effects in shrubs and no CNDD in tree- 
lets (13), as well as little CNDD for common 
species (JO), even the most common treelets 
(Fig. 2, A to C), shrubs (Fig. 2, D to F), and trees 
(Fig. 2, G to I) are overdispersed, as are rela- 
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tively rare species (data S1). Not surprisingly, 
some light-demanding and _ habitat-specialist 
species deviate from this typical pattern, ex- 
hibiting increased aggregation reflective of 
the limited spatial extent of the conditions they 
require (Fig. 3). 


Strong repulsion is caused mostly by 
conspecific effects 


Theoretical studies show that overdispersion 
is the signature of negative density depen- 
dence (16, 17), leading us to infer that the over- 
dispersion that we detected reflects the spatial 
signature of CNDD acting on juveniles—for 
which much evidence on Barro Colorado Island 
exists (10-13)—propagating to affect the adults. 
With a simulation model (fig. S11) (35), we 
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further show that the observed overdispersion 
is only consistent with a scenario when CNDD » 
HNDD. Figure 4 and figure S12 show the degree 
of overdispersion generated in spatially explicit 
stochastic simulations of community dynamics 
in which CNDD and HNDD both reduce the 
survival of dispersing juveniles and are set to 
be strong out to a distance of about 7 m (fig. S11). 
First, when the magnitude of CNDD equals 
that of HNDD (Fig. 4, along the diagonal), 
overdispersion in the simulations quickly 
asymptotes with the magnitude of density de- 
pendence at values indicating much lower over- 
dispersion than observed on Barro Colorado 
Island. Although EA(20) is ~-1.5 in Barro 
Colorado (Table 1), the simulated EA(20) 
asymptotes at much lower values (~-0.18 
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Z. ekmanii 
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C. billbergianus 


J. copaia 


Fig. 3. Spatial distributions of four atypical species and their EA(r). 
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associations with the large high-plateau and slope habitats (49), which 


Calculation of EA(r) can be found in Eq. 2. Each row presents a different species, 
whereas the columns present for each species the observed distribution (left) 
versus a single realization of the standard Dispersal Limitation null (center). 
EA(r), calculated by using 10 m distance bins, is presented in the shaded column 
(right), with the orange region indicating the 95% global envelope (48) of the 
Dispersal Limitation null. EA(r) of O corresponds to the expectation of Dispersal 
Limitation, whereas values <O indicate overdispersion. All distribution maps are 
1000 by 500 m. (A to C) Coussarea curvigemmia is a habitat specialist with a 
strong positive affiliation with the large low-plateau habitat and negative 


leads to aggregation up to large distances. The three other species, 

(D to F) Zanthoxylum ekmanii, (G to I) Croton billbergianus, and (J to L) 
Jacaranda copaia, are fast-growing gap specialists (10). They all have increased 
aggregation at short scales (0 to 30 m), typical of gaps, compared with 
intermediate scales. Z. ekmanii is also estimated to have a very large dispersal 
distance (105 m) (32), which leads to a relatively random spatial distribution 
under Dispersal Limitation. Compared with this expected distribution, the 
observed clumps of trees lead to the high positive values of EA(r) at small 
scales for this species. 


or ~-0.55) (Fig. 4 and figs. S13 and S14). 
Intuitively, when HNDD equals CNDD, the 
competitive effect of neighbors is equal regard- 
less of their species, and so density dependence 
cannot generate a strong repulsion from con- 
specifics. Second, only the scenario CNDD » 
HNDD (Fig. 4, upper left corners) is compat- 
ible with the magnitude of overdispersion ob- 
served on Barro Colorado. Last, our results 
are similar for the “fixed” Dispersal Limitation 
variant (table S1)—in which the locations of all 
adults are preserved—and the standard dis- 
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persal limitation variant. The fixed variant 
preserves the spacing between all trees of all 
species, which result from competition for light, 
soil resources, and other non-species-specific 
factors, hence incorporating HNDD and some 
CNDD into the null expectation. The strong 
overdispersion compared with this variant fur- 
ther confirms the weakness of HNDD com- 
pared with CNDD. 

Although we expected overdispersion to occur 
primarily at the short distances at which CNDD 
operates (~20 m) (2, 10, 13), we found that 
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several species are significantly and substan- 
tially overdispersed at distances up to ~100 m, 
with all species having on average ~ 45% fewer 
than expected conspecific neighbors 75 to 125 m 
away from a typical tree (under the standard 
Dispersal Limitation null and most other model 
variants) (Fig. 2 and table S2). Our simulations 
show that such long-distance overdispersion can 
result from short-distance CNDD (acting to 7 m) 
(fig. $15). In hindsight, this result appeals to 
intuition. Although each tree “repels” only its 
neighbors, they in turn repel their neighbors 
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Fig. 4. Strong overdispersion is only possible when CNDD >> HNDD. We present the Excess neighborhood 
Abundance at r = 20 m, EA(20) (Eq. 2), in simulations. In each simulation, the magnitude of CNDD 

(Qc, y axis) and HNDD (Qj, x axis) were controlled, and the color of each square represents the mean EA(20) 
of simulated populations. In the simulations, seeds are dispersed an average distance of (A) D = 7 or 

(B) 20 m, and the survival of each is determined by both the density of conspecific adults and the density 
of heterospecific adults in a way that gives large weight to adults within ~7 m of where the seed lands. For 
perspective, a seed landing right beside a single conspecific would have a recruitment probability of 

1/(1 + Qc), and more nearby conspecifics and heterospecifics would reduce this probability further (35). 
Results are not shown for cases in which HNDD > CNDD (below the diagonal) because priority effects lead to 
one species taking over these communities. When CNDD = HNDD (on the diagonal), overdispersion is weak, 
and strong levels of overdispersion, comparable with those observed for trees on Barro Colorado Island, 


only occur when CNDD >> HNDD. 


and so forth, creating large-scale patterns from 
short-distance interactions. 


Overview and implications for species 
diversity maintenance 


Several previous analyses have revealed spa- 
tial signatures of density dependence (30, 37), 
but most of these studies do not focus on adults, 
therefore leaving open the question of whether 
the effects propagate to the adult stage, a prop- 
agation that is needed for coexistence (4, 27). 
Focusing on larger trees and community-level 
spatial patterns, May et al. (21) found that all 
trees =10 cm diameter at breast height are 
overdispersed compared with a Random null 
model at distances <10 m, which indicates 
general density dependence among all trees 
(including conspecifics and heterospecifics) 
of all species. Several studies also found that 
conspecific aggregation decreases with tree 
size (18, 20, 38), but those that considered 
more than a handful of species found signif- 
icant reductions in only 15 to 22% of species 
studied (18, 20). By contrast, we found that 
most species are overdispersed at the adult 
stage. This difference might occur because the 
strongest effects of CNDD occur at early life 
stages (39) before the sapling stage consid- 
ered by (8, 20). This highlights the utility of 
comparison with the Dispersal Limitation 
null model, which uses seed rain as the refer- 
ence. No previous study of spatial patterns 
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of which we are aware evaluated the relative 
strength of CNDD and HNDD, whereas we 
show that species-specific spatial patterns of 
adults are only consistent with a scenario when 
CNDD » HNDD across life stages. 

These findings have implications for the 
maintenance of species diversity. First, our 
finding that CNDD is substantially greater than 
HNDD for many species implies large niche 
differences among species, which may stabi- 
lize diversity because species that increase in 
abundance suffer reduced fitness. However, 
our results do not establish that species in 
Barro Colorado Island do indeed stably coexist 
because of CNDD: It has recently been debated 
whether differences in species-level competitive 
ability can overwhelm the stabilizing effects 
of CNDD, especially in species-rich communi- 
ties (40, 41). Moreover, the ability of CNDD to 
maintain diversity may further be hindered by 
differences between species in susceptibility 
to CNDD (42) and by the local aggregation 
of rare species (43). Hence, the potential of 
CNDD to maintain species diversity needs fur- 
ther investigation. 

Another implication for diversity mainte- 
nance concerns f diversity. Given that the 
forest is saturated with individuals (31), the 
finding that individuals of most species have 
fewer conspecific neighbors up to long dis- 
tances implies an increase in heterospecific 
neighbors and an increase in compositional 
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turnover at these distances. This is consis- 
tent with the reported increased spatial turn- 
over (f diversity) at scales of 20 to 200 m in 
Barro Colorado compared with a neutral model 
(22). B diversity has been extensively studied 
and is believed to be driven mostly by spatial 
heterogeneity, dispersal limitation, and inter- 
specific competition (22, 44). Our results sug- 
gest that the role of CNDD in shaping turnover 
should also be carefully considered. 

More broadly, patterns of species distribu- 
tions at local scales have been investigated 
and compared to Random null models for more 
than 100 years (45), often with the aim of de- 
tecting the signals of habitat specificity and 
biotic interactions (18, 19, 26, 46). However, 
theoretical models predict that dispersal limi- 
tation alone could override and obscure the 
signal of biotic interactions and dominate 
spatial patterns, creating ubiquitous aggregated 
distributions (17). Indeed, spatial patterns have 
been found to be ubiquitously aggregated with 
respect to the Random null (78, 19, 46). By con- 
trast, a richness of spatial patterns is revealed 
with respect to the Dispersal Limitation null 
model: Most species are overdispersed, owing 
to CNDD, whereas a few habitat and gap 
specialists are more aggregated (Fig. 3). This 
demonstrates the power of the Dispersal Limi- 
tation null to filter out much of the effect of 
dispersal limitation and to reveal the overall 
effect of biotic interactions and habitat speci- 
ficity on the spatial distributions of adult 
individuals. Our work thus promotes a com- 
plementary definition of the classical terms 
“aggregation” and “overdispersion,” which 
are rooted in ecological processes and theory. 
We believe that a more widespread use of this 
approach—using data on spatial distributions 
and seed dispersal from forests plot around 
the globe (9)—will have much capacity to shed 
light on the processes that shape the assembly 
of ecological communities and on the differ- 
ences between them. 
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CYTOKINE SIGNALING 


Structural basis of y chain family receptor sharing 


at the membrane level 


Tiantian Cai’++, Rachel Lenoir Capello", Xiong Pi"?, Hao Wu*”, James J. Chou?*§ 


Common y chain (yc) cytokine receptors, including interleukin-2 (IL-2), IL-4, IL-7, IL-9, IL-15, and IL-21 receptors, 
are activated upon engagement with a common yc receptor (CD132) by concomitant binding of their 
ectodomains to an interleukin. In this work, we find that direct interactions between the transmembrane 
domains (TMDs) of both the yc and the interleukin receptors (ILRs) are also required for receptor activation. 
Moreover, the same yc TMD can specifically recognize multiple ILR TMDs of diverse sequences within the 
family. Heterodimer structures of yc TMD bound to IL-7 and IL-9 receptor TMDs—determined in a lipid 
bilayer-like environment by nuclear magnetic resonance spectroscopy—reveal a conserved knob-into-hole 
mechanism of recognition that mediates receptor sharing within the membrane. Thus, signaling in the yc 
receptor family requires specific heterotypic interactions of the TMDs. 


embers of the common y chain (yc) 
family of cytokine receptors, upon 
binding to their corresponding inter- 
leukin, pair specifically with a com- 
mon, shared yc receptor (CD132) to 
allow signal transduction (/, 2). To date, the 
family consists of six members: the interleukin-2 
receptor (IL-2R), IL-4R, IL-7R, IL-9R, IL-15R, 
and IL-21R. These receptors play crucial roles in 
the development and proliferation of multiple 
lymphocyte lineages of both the innate and 
adaptive immune systems (J, 3) and are clinically 
important targets for immunotherapy (2, 4, 5). 
Interleukin receptors (ILRs) and the yc are type 
I transmembrane proteins comprising a ligand- 
binding ectodomain, a transmembrane domain 
(TMD), and an intracellular domain (ICD) con- 
taining a Janus kinase (JAK)-binding site. 

The mechanism by which interleukins me- 
diate specific pairing of the yc with ILRs has 
been revealed by crystallographic studies of the 
IL-2-bound and IL-4-bound receptor com- 
plexes (6-8). In both cases, the interleukin 
binding to an ILR generates a composite sur- 
face for subsequent interaction with the yc. 
This interaction interface is small (~1000 A”) 
and characterized by relatively flat surface com- 
plementarity (7), constituting the structural 
basis for the degenerate cytokine recognition 
by the yc. 

One aspect of ILRs not well understood is 
the TMD and its possible role in receptor sig- 
naling. This question was inspired by previous 
studies on other receptors that have revealed 
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much more active roles of TMDs in mediating 
receptor oligomerization and activation than 
conventionally appreciated. Examples include 
type 1 cytokine receptors, such as erythropoi- 
etin receptor (9), thrombopoietin receptor (10), 
growth hormone receptor (J/, 12), receptors in 
the tumor necrosis factor receptor superfamily 
(13, 14), and receptor tyrosine kinases (15, 16). 
An early cryo-electron microscopy (cryo-EM) 
study of a full-length transmembrane form of 
a quaternary cytokine-receptor complex com- 
prising gp130, LIF-R, the cytokine ciliary neuro- 
trophic factor (CNTF), and its alpha receptor 
(CNTF-Ra) has suggested that the TMDs of these 
receptor chains are associated (17). Furthermore, 
mutations within TMDs of the yc family recep- 
tors have been recorded in clinical studies. For 
example, a single mutation (V253G) in the IL-7R 
TMD is associated with acute lymphoblastic 
leukemias (ALLs) (8). The structural basis of 
these disease mutations is unknown owing to 
a lack of structural information for the trans- 
membrane region of the yc family receptors. 

In this study, we report that heterotypic in- 
teractions between the TMDs of the yc and its 
family members are an essential component of 
receptor signaling, which raises the question 
how a single transmembrane helix (TMH) can 
have the structural sophistication to specifi- 
cally recognize a variety of TMHs with highly 
divergent sequences. We determined high- 
resolution structures of the yc TMD in com- 
plex with IL-7R and IL-9R TMDs in bicelles 
that mimic a lipid bilayer. The two heterodimer 
structures and functional mutagenesis reveal 
a common knob-into-hole mechanism that 
underlies degenerate ILR pairing with the 
ye within the membrane. 


Removal of an ILR (IL-7R or IL-9R) and 
yc ectodomains activates JAK-STAT signaling 


A large body of structural and functional studies 
on members of the yc family receptors have 
described receptor activation (4, 6-8, 19). 
Briefly, interleukins bind to the ectodomains 


® 


positioning the ICDs in the correct arrange- 
ment to allow reciprocal phosphorylation of 
JAK1 and JAK3 and to activate downstream 
signal transducers and activators of transcrip- 
tion (STAT) signaling (Fig. 1A). However, the 
ectodomain and ICD are separated by the 
TMD, which may also contribute to driving a 
signaling-compatible configuration. The pre- 
ligand conformation of receptor ectodomains 
could physically prevent ligand-independent, 
TMD-driven signaling (74, 20). To test whether 
heterotypic association between the ILR TMD 
and the yc TMD can be achieved by the remo- 
val of ectodomains, wild-type (WT) or cleav- 
able human IL-7R (hIL-7R) and yc were 
designed (Fig. 1B) and coexpressed in BaF3 
cells. After incubation with TEV protease to 
shed the ectodomains, activation was eval- 
uated with STATS phosphorylation. 

Both WT receptors (hIL-7R and hyc) and 
cleavable receptors (TEV-hIL-7R and TEV-hyc) 
were well expressed on the cell membrane with 
enhanced green fluorescent protein (EGFP) 
and mCherry fused to the C termini of hIL-7R 
and hyc, respectively (fig. S1A). Colocalization 
of hIL-7R and hyc as well as sparse bright 
puncta were observed in the absence of hIL-7, 
consistent with previous accounts of both 
homotypic preassociation of hIL-7R (27) and 
heterotypic preassociation with hyc (22, 23). 
More puncta appeared on the cell surface after 
ligand incubation as a result of greater recep- 
tor clustering induced by ligand binding. Pre- 
dictably, this was accompanied by receptor 
activation as indicated by STAT5 phosphoryl- 
ation (fig. S1B). The TEV-cleavable hIL-7R and 
hyc exhibited analogous ligand response to 
that of the WT receptors (fig. SIB) but could be 
activated by incubating BaF3 cells with TEV 
protease in the absence of ligand (Fig. 1C). The 
TEV protease dose-response profile revealed 
that ~40% maximal activation could be reached 
compared with IL-7-induced activation (Fig. 
1D) and that the increase of receptor activa- 
tion correlated with increased shedding of 
receptor ectodomains from the cell surface 
(fig. S1, C to H). More cell-surface puncta were 
observed after treatment with 50 ug/ml of TEV 
protease, which suggests the ability of TMH- 
ICD to cluster, though substantially less than 
in the context of ectodomain-ligand engage- 
ment (fig. SID). By contrast, no activation was 
detected after adding TEV protease to cells 
expressing WT receptors (Fig. 1, C to D, and fig. 
S1D. Similar receptor activity induced by pro- 
teolytic removal of ectodomain was observed 
for cleavable hIL-9R and hyc (fig. $2), which 
suggests that heterotypic interaction of TMDs 
is a general phenomenon for yc receptor fam- 
ily members. 

To independently examine the function- 
al consequence of ectodomain removal, we 
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designed two constructs with the ectodo- 
main (ECD) deleted (hIL-7R-AECD and hyc- 
AECD) and expressed each of them alone or 
paired with a WT or ectodomain-deleted 


receptor (Fig. 1E). The membrane integra- 
tions of hIL-7R-AECD and hyc-AECD were 
confirmed by confocal microscopy (fig. S1J). 
Among receptor-expressing cells, those co- 


expressing hIL-7R-AECD and hyc-AECD ex- 
hibited the strongest signaling, reaching ~40% 
of ligand-induced STAT5 phosphorylation 
by the WT receptors (Fig. 1, F and G). This result 


Fig. 1. Removal of the 
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or WT receptors (WT IL-7R and = 
WT yc) (bottom) after treat- 
ment with 0 to 200 ug/ml of 
TEV protease or 50 ng/ml 

of hIL-7. (D) Quantification of 
protease-induced pSTAT5 
signals in (C) as pSTAT5/ 
STATS intensity ratios, which 
are further normalized relative 
to that induced by hIL-7. 

The data are shown as 

means + SEMs calculated from 
all technical replicates from 
three independent experiments 
(with one to two technical 
replicates per experiment). 
Statistical significance by 
Student's t test. ns, not 
significant; **P < 0.01; 

***P < 0.001. (E) Schematic 
illustration of all possible 
combinations of full-length 
receptors (IL-7R, yc) and 
ECD-deleted receptors 
(IL-7R-AECD, yc-AECD) 
expressed in BaF3 cells for 
testing ligand-independent signaling. (F) IL-7R pathway activation reported by phosphorylation of STATS, JAK1, and JAK3 in BaF3 cells expressing full-length and 
ECD-deleted receptor combinations illustrated in (E). (G) Quantification of pSTAT5 signals in (F) as pSTAT5/STATS intensity ratios, which are further normalized 
relative to that induced by hIL-7. The data are shown as means + SEMs calculated from all technical replicates from three independent experiments (with one to two 
technical replicates per experiment). Statistical significance by Student's t test. ns, not significant; *P < 0.05; **P < 0.01. 
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was consistent with the activation achieved by 
proteolytic removal of ectodomains (Fig. 1, C 
and D). The cells expressing hIL-7R-AECD or 
hyc-AECD (with a WT receptor) also showed 
weaker but measurable ligand-independent 
signaling (Fig. 1, F and G), which suggests that 
both hIL-7R and hyc ectodomains are involved 
in receptor autoinhibition in the absence of 
ligand. They would prevent the TMD-ICD re- 
gions of hIL-7R and hyc from forming signaling- 
competent complexes. Thus, TMDs appear 
to play an active role in the heterotypic asso- 
ciation between hIL-7R and hyc that drives 
signaling. 
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Fig. 2. Specific heterodimerization of ycTMD with the TMDs of its family 
members. (A) Schematic diagram showing receptor-sharing network for the yc 
(blue) and Bc (yellow) receptor families. Some complexes require a third chain for 
cytokine recognition and signal transduction, such as IL-2Ra and ILI5Ra. IL-4Ra 
and IL-7Ra. can also form heterodimer complexes with IL-13Ra and TSLPR, 
respectively. (B) Schematic illustration of the BACTH system (30) for analyzing 
TMD interactions. ATP, adenosine 5'-triphosphate; cAMP, adenosine 3',5'- 
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TMD interactions match the 

receptor-sharing network 

We next investigated whether the TMD of 
the yc (designated ycTMD) could directly as- 
sociate with that of ILR members (designated 
IL-xRTMD) in the cell membrane and whether 
the heterodimerization of TMDs also exhib- 
ited properties of receptor sharing as do the 
ectodomains. We selected the TMDs of yc, 
IL-4R, IL-7R, and IL-9R from both human 
and mouse to test oligomerization (Fig. 2A). 
We also selected the TMD of IL-5R from the 
6 chain (Bc) family to examine possible non- 
specific association with ycTMD. The TMD 


sequences of these receptors (fig. $3) did not 
show any conserved small-xxx-small motif 
that mediates TMH dimerization in growth 
factor receptors (16, 24-26). Nor did they 
show charged residues that mediate intra- 
membrane assembly of activating immuno- 
receptors (27-29). Thus, pairing of the ye TMD 
with different ILR TMDs was mediated pos- 
sibly by a previously unrecognized type of 
interaction. 

TMD interactions were examined using a 
bacterial adenylate cyclase two-hybrid (BACTH) 
assay (30). Briefly, two complementary domains 


(T18 and T25) of adenylate cyclase (AC) were 
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monophosphate; lacZ, B-galactosidase; X-Gal, X-galactosidase. (©) BACTH analysis 
of TMD interactions for representative yc family receptors and the Bc family 
member IL-5R from both mice and humans. Blue colonies indicate TMD-TMD 
association in the bacteria inner membrane. (D) Quantification of BACTH colony 
colors, which are normalized relative to the IbaG/IbaG positive control. The data are 
shown as means + SEMs calculated from technical replicates (typically three to 
four) from one representative of three independent experiments. 


3 of 8 


RESEARCH | RESEARCH ARTICLE 


fused, separately, to the TMDs under inves- 
tigation (Fig. 2B). The TMD fusion proteins 
were expressed in an AC-deficient (CyaA) strain 
that yielded white bacteria colonies by default. 
Stable association of two TMDs reconstitutes 
AC activity, resulting in blue colonies. For 
both mouse and human sequences, yeTMD 
can stably heterodimerize with the TMDs of 
IL-4R, IL-7R, and IL-9R—the corresponding 
colonies showed the strongest blue colors 
(Fig. 2C and fig. S4). By contrast, no asso- 
ciation between the TMDs of yc and IL-5R 
was detected (Fig. 2C and fig. S4), which in- 
dicates no cross-family TMD association. Ad- 
ditional BACTH results for human yc, IL-2Ro, 
IL-2Rf, IL-15R, and IL-21R from the yc 
family and for human fc and IL-5R from the 


Be family (fig. S5) demonstrated that hetero- 


Fig. 3. NMR structures of ycTMD 
in complex with IL-7RTMD and 
with IL-9SRTMD show a common 
knob-into-hole mechanism of 
recognition. (A) (Left) Detecting 
residue-specific interchain NOEs 
using the sample of (SN, 7H) 
ycIMD mixed with ($C) IL-7RTMD 
or IL-9RTMD at a 1:1 molar ratio. 
The experiment involves recording 
two interleaved 'H-°N TROSY- 
HSQC spectra: one at equilibrium 
and the other with the “c-attached 
liphatic protons inverted during 
200 ms of NOE mixing. (Right) 
Overlaying the difference between 
two interleaved *H-°N TROSY- 
HSQC spectra (red) onto the 
reference spectrum (blue) reveals 
ycIMD residues in close contact 
with IL-7/9RTMD. The spectra 
were recorded at 303 K and 'H 
frequency of 900 MHz. ppm, parts 
per million. (B) Ribbon representa- 
tion of the structures of ycTMD 

in complex with IL-7RTMD (left) and 
with IL-9RTMD (right) in DMPC- 
DHePC bicelles with q = 0.4. 
(C) A close-up view of ycTMD 1274 
(sphere) fitting into the hydrophobic 
pocket of IL-7RTMD (yellow) formed 
by the ®1-xx-@2@3-xx-@ motif. 

(D) The same close-up view as in 
(C) of ycTMD 1274 fitting into the 
pocket of IL-9RTMD (pink). Single- 
letter abbreviations for the amino 
acid residues are as follows: A, 

Ala; C, Cys; D, Asp; E, Glu; F, Phe; 
G, Gly; H, His; |, lle; K, Lys; L, 

Leu; M, Met; N, Asn; P, Pro; Q, 
Gln; R, Arg; S, Ser; T, Thr; V, Val; 
W, Trp; and Y, Tyr. 
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typic TMD associations are consistent with the 
ye family receptor-sharing network (Fig. 2D). 

A homotypic TMD interaction was also ob- 
served for IL-2Roa and IL-2R8, IL-4R, IL-7R, IL- 
OR, and IL-21R, though on average it was 
weaker than the heterotypic interaction in- 
volving ycTMD (Fig. 2C and fig. $5). Although 
the function of homotypic TMD interactions is 
unknown, it may contribute to preassembly 
of ILRs before ligand engagement. A previous 
study had suggested that the IL-2Ra and IL- 
2RB8 are already colocalized in resting T cells in 
the absence of IL-2 (37). 


Structures of ycTMD in complex with 
IL-7RTMD and with IL-9RTMD in bicelles 


The specific recognition of the TMDs of yc 
family ILRs by yecTMD was unexpected be- 


yc TMD / IL-7R TMD 


cause their sequences are highly divergent (fig. 
83). To understand the structural basis of re- 
ceptor sharing at the membrane level, we de- 
termined high-resolution nuclear magnetic 
resonance (NMR) structures of two represen- 
tative heterodimer complexes in bicelles 
mimicking a lipid bilayer: yeTMD bound to 
IL-7RTMD (figs. S6 and S7) and yeTMD bound 
to IL-9RTMD (figs. S6 and S8). In addition, 
an intermolecular nuclear Overhauser effect 
(NOE) difference experiment revealed ycTMD 
residues in the interaction interface. 1274, 
L277, and 1278 in particular experienced inter- 
molecular NOE both in complex with IL-7RTMD 
and with IL-9RTMD (Fig. 3A). 

The two transmembrane heterodimer struc- 
tures were unexpectedly homologous. Foremost, 
this structural similarity was characterized by 
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an ~20° helical packing angle and the involve- 
ment of the same face of yeTMD in packing 
against IL-7RTMD or IL-9RTMD (IL-7/9RTMD) 
(Fig. 3B). Closer inspection of the two struc- 
tures identified 1274 of ycTMD as the key 
residue that filled a hydrophobic hole of IL- 
7/9RTMD comprising four residues (Fig. 3, C 
and D), reminiscent of a knob-into-hole mech- 
anism. These four residues had a sequence 
arrangement of ®,-xx-D,®3-xx-®,, where ®; 
represents hydrophobic residues such as iso- 
leucine, leucine, valine, and phenylalanine that 
constitute the pocket. Other ycTMD residues, 
1266, P267, T270, and L277 also made close 
van der Waals (VDW) contacts with receptor 
TMDs but did not appear to be locked into a 
hole (Fig. 4, A and B). For the receptor TMD, 
$252, L255, L259, and V262 of IL-7RTMD and 


A yc TMD IL-7R TMD 


1279, L282, L283, and F286 of IL-9RTMD were 
in close VDW contact with ycTMD, but none 
of them were wrapped by a defined pocket 
(Fig. 4, A and B). 

Mutating P267, T270, 1274, or L277 of yeTMD 
to tyrosine disrupted heterodimerization with 
IL-4RTMD, IL-7RTMD, and IL-9RTMD (Fig. 
4C and fig. S9, A to H). A similar pattern was 
observed for IL-21RTMD (fig. S91). Mutating 
$252 or L255 of IL-7RTMD to tyrosine also abol- 
ished heterodimerization (Fig. 4D and fig. S9J). 
Thus, these NMR structures are consistent with 
those expressed in the cell membrane and re- 
veal that intramembrane recognition by the yc 
of different family members is based largely on 
the same knob-into-hole mechanism (Fig. 4E). 

This relatively small binding area—501.4 A? 
between ycIMD and IL-7RTMD or 479.0 A” be- 


ye TMD IL-7R TMD Cc 


tween ycIMD and IL-9RTMD—was consistent 
with the small number of residues involved in 
the knob-into-hole interaction. Confinement 
of interaction to a small area could explain 
why IL-7RTMD and IL-9RTMD could both be 
recognized by ycTMD despite only having 35% 
identity. This interaction is reminiscent of the 
small interaction interface between the yc 
ectodomain and cytokines (7), which suggests 
that degenerate recognition by the yc is pos- 
sible because of a small but structurally con- 
sistent recognition surface pattern. 


Essential role of the knob-into-hole interaction 
in the membrane for IL-7R signaling 


To determine whether the specific knob-into- 
hole mechanism of TMD heterodimerization 
is relevant to receptor signaling, we performed 
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Fig. 4. Validation of the surface complementarity and the common mecha- 
nism of ycTMD sharing by mutagenesis. (A) The contours of the ycTMD-IL- 
7RTMD heterodimer interface shown with the IL-7RTMD strand surface rendered 
(left) and the ycTMD surface rendered (right). The ycTMD and IL-7RTMD residues in 
sphere are residues tested by mutagenesis in (C), with red and white indicating 
dimer disruption and no effect, respectively. (B) Same as in (A) for the ycTMD-IL- 
ORTMD heterodimer. (C) BACTH analysis of the effect of ycTMD mutations on 
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ycI MD association with IL-7RTMD, IL-9RTMD, or IL-4RTMD. Corresponding mutations 
for human sequences are shown in fig. S9. Results are from one representative of 
three independent experiments with three replicates per experiment. (D) BACTH 
analysis of the effect of IL-7RTMD mutations on association with ycTMD. Results are 
from one representative of three independent experiments with three replicates per 
experiment. (E) Schematic illustration of the knob-into-hole mechanism of recognition 
that mediates receptor sharing within the membrane. 
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Fig. 5. Specific TMD heterodimerization is required for ligand-induced replicates per experiment. The images represent one of two biological 
IL-7R signaling. (A) Cylinder representation of the ycTMD-IL-7RTMD replicates with similar results. Scale bars, 5 wm. (E) Phosphorylation of 
heterodimer structure showing the positions of the TMD residues of yc and STAT5 (pSTAT5) in BaF3 cells coexpressing WT or mutant hIL-7R with 
IL-7R (sphere) that were tested for ligand-induced receptor signaling. WT or mutant hyc after treatment with 50 ng/ml of hIL-7. The pSTAT5 
Specifically, 1274 of ycTMD and L255 of IL-7R are important for the knob- signals were detected by immunoblotting (top) and compared with 
into-hole mechanism, whereas G272 of ycTMD and V253 of IL-7R are not immunoblot signals of STAT5 (middle). (F) Quantification of pSTAT5 
involved in heterodimerization. (B to D) Coexpression and distribution signals in (E) as pSTAT5/STATS intensity ratios, normalized relative to 
of WT hIL-7R (green) and hyc (magenta), of WT hIL-7R (green) and hyc that of the WT receptors. The data are shown as means + SEMs calculated 
mutants (magenta), and of WT hyc (magenta) and hIL-7R mutants (green) from all technical replicates from three independent experiments 

on the surface of BaF3 without and with treatment with hIL-7. The nuclei (with one to two technical replicates per experiment). Statistical 

of 1 x 10° cells were stained with 4',6-diamidino-2-phenylindole (DAPI) significance by Student's t test. ns, not significant; ****P < 0.0001. 
(blue). Three independent experiments were performed with two PBS, phosphate-buffered saline. 
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structure-guided mutagenesis and tested these 
mutants using functional assays similar to 
those used for receptor activation by TEV pro- 
tease (Fig. 1, C and D). On the basis of the 
structure of ycTMD in complex with IL-7RTMD, 
the knob residue of ycTMD was 1274 in mice 
and 1273 in humans, and a key constituent of 
the hole in IL-7RTMD is L255 (Fig. 5A). These 
residues were mutated to tyrosine to disrupt 
the knob-into-hole interaction. In addition, the 
glycine on the opposite side of the ycTMD 
(G271 in humans) and V253 on the opposite 
side of the IL-7RTMD were mutated to tyro- 
sine as no-effect mutations. 

Full-length hye and hIL-7R as well as their 
single mutants were expressed in BaF3 cells. 
The surface expressions of the mutants were 
all within ~28% of that of the WT (fig. S10) and 
exhibited similar levels of hIL-7 binding (fig. 
S11). Sparse puncta were observed before lig- 
and addition for all variants, which suggests 
that the introduction of the TMD mutations 
did not affect preligand association (Fig. 5, B 
to D). Many more puncta appeared after ligand 
addition for cells coexpressing WT hIL-7R and 
G271Y hyc and for cells coexpressing V253Y 
hIL-7R and WT hyc (Fig. 5, C and D), consist- 
ent with IL-7-induced receptor clustering. By 
contrast, cells expressing either I1273Y hyc or 
L255Y hIL-7R showed no detectable increase 
of puncta after ligand addition, which indicates 
that preventing TMD heterodimerization re- 
duced ligand-induced receptor clustering. 

The imaging results were independently 
confirmed by immunoblot analysis of STAT5 
phosphorylation. Disrupting either the knob 
with the I1273Y mutation or the hole with the 
L255Y mutation completely abolished signali- 
ng, whereas G271Y in hyc or V253Y in hIL-7R 
showed no detectable differences in signaling 
compared with the WT receptors (Fig. 5, E 
and F). Similarly, a mutation of IL-9R that 
disrupts TMD heterodimerization (L282Y) also 
abolished signaling, whereas a mutation away 
from the heterodimer interface (T284Y) showed 
no effect on signaling (fig. S12). Thus, the het- 
erotypic association of receptor TMDs me- 
diated by the knob-into-hole mechanism is 
essential for ligand-induced signaling of IL- 
7R and IL-9R. Moreover, this mechanism is 
likely applicable to other members of the yc 
family. 

Although the V253Y mutation of IL-7RTMD 
did not alter signaling, consistent with the 
structure, the gain-of-function mutation V253G 
(18) could mediate a homotypic IL-7RTMD in- 
teraction that enhanced receptor clustering 
and signaling, as glycine is often implicated 
in TMH dimerization. In addition to knob- 
into-hole interaction mutations, we tested hyc 
TMD mutations found in patients with severe 
combined immunodeficiency diseases (SCIDs), 
including I265N, G268R, M270R, G271E, C278W, 
and V279M (32-34) (fig. S13). These results 
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are consistent with the finding that TMD het- 
erodimerization is required for signaling. 
The mutations C278W and V279M did not 
significantly reduce TMD heterodimerization 
between yc and IL-7R- or IL-7-induced signal- 
ing (fig. S13, F and H). However, these mutations 
may affect yc intramembrane interaction in 
the more complicated trimeric complex com- 
prising IL-2Ra or IL-15Ra, IL-2RB, and ye TMDs 
(Fig. 2A). 


Discussion 


The yc TMD is capable of specific recognition 
of multiple ILR TMDs that have low sequence 
homology (fig. S3). A notable structural ques- 
tion is how specificity and promiscuity are 
concomitantly achieved. The complementary 
surface representations of IL-7/9RTMD and 
ycTMD (Fig. 3) show that IL-7RTMD and IL- 
QRTMD both have a deep pocket fitting the 
1274 knob of ycTMD. In this context, the 1274 
knob of ycTMD can be perceived as the key 
that fits the small hydrophobic hole (or lock) 
presented by the different ILR TMDs, which 
would explain the promiscuity. The structural 
features governing specificity, however, may 
go beyond the knob-into-hole module. The 
vertical positions in the lipid bilayer of the 
two TMDs are important for aligning the 1274: 
knob with the hydrophobic hole for binding. 
Other supporting interactions may further en- 
hance specificity, such as the interaction of 
L277 and Y281 of ycTMD with the comple- 
mentary surface of the receptor TMD near its 
C-terminal end (Fig. 4, A and B). The three 
ycTMD residues with 100% conservation across 
species (1274, L277, and Y281) (fig. S3) are pre- 
cisely those residues making close VDW con- 
tacts with the receptor TMDs in our structures. 
Specific TMD pairing required for signaling 
of the yc family receptors may inform the 
generation of TMDs that can modulate recep- 
tor activity. Although the design of transmem- 
brane proteins remains challenging, several 
studies have demonstrated its feasibility for 
TMD oligomerization (35-37). The use of a 
designed transmembrane peptide to modu- 
late immunoreceptor activity by competing 
with the TMD heterodimerization between 
the integrin a and B subunits could push the 
equilibrium from an inactive to an active 
state (38). More recently, designed TMD oligo- 
merization has been applied to mediate chi- 
meric antigen receptor (CAR) oligomerization to 
enhance the therapeutic window of CAR T 
immunotherapies (39). The distinct features 
of the knob-into-hole interaction that medi- 
ate yc sharing and the structural differences 
among the ILR members of the family could 
be exploited to fashion decoy ycTMDs for the 
selective interference of cytokine signaling. 
The partial signaling after removal of recep- 
tor ectodomains strongly suggests that they 
are arranged in an autoinhibitory state in the 


absence of ligand. This autoinhibition likely 
involves homotypic and heterotypic associa- 
tions of the ectodomains of ILRs and yc, which 
is consistent with our observation that the co- 
expression of IL-7R and yc resulted in their 
colocalization on the cell surface (fig. S1, A 
and I). The preligand association of ILRs and 
ye may also organize the receptors for a biased 
response to ligand activation. Previous studies 
have shown that preassociation of the ye with 
the IL-7R reduces the amount of free yc for 
other members of the family, which results in 
asymmetrical cross-talk toward stimulation 
with different cytokines (22, 23). To date, no 
structural information has been made availa- 
ble for heterotypic preligand association in 
the yc receptor family. A crystal structure of 
homotypic association, however, has been re- 
ported for the IL-7R ectodomain (27), which 
shows a head-to-head antiparallel configura- 
tion proposed to be autoinhibitory, as it keeps 
the TMD-ICDs apart (19). 

It is unclear whether the TMD homotypic 
interaction competes or synergizes with the 
heterotypic TMD interaction in receptor activa- 
tion. Our intermolecular NOE data of IL-7RTMD 
in bicelles suggest that TMD self-association is 
driven by the packing of hydrophobic residues 
(F251, L255, and 1258) from the opposite 
TMDs (fig. S14, A and B). The L255Y mutation 
prevented hIL-7RTMD self-association in the 
BACTH assay (fig. S14), but L255 is an integ- 
ral part of the hydrophobic pocket in the 
knob-into-hole mechanism (Fig. 3C). Thus, at 
least in the case of IL-7RTMD, the homotypic 
association is expected to compete with the 
heterotypic interaction with the yc TMD. The 
functional relevance of this result remains to 
be studied. 

We have demonstrated that TMDs of the yc 
family of cytokine receptors can conserve spec- 
ificity without sacrificing promiscuity using a 
knob-into-hole recognition pattern that medi- 
ates receptor sharing in the membrane and is 
important for receptor activation. Our finding 
provides a conceptual framework for con- 
structing TMHs to modulate yc family recep- 
tor activities to potentially address the gain- or 
loss-of-function disease mutations associated 
with these TMDs. 
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TECHNICAL COMMENT 


Comment on “Food wanting is mediated by transient 
activation of dopaminergic signaling in the honey 


bee brain” 


Andrew Barron’, Susan E. Fahrbach?, Alison R. Mercer, Karen A. Mesce*, David J. Schulz®, 


Brian H. Smith®, Eirik Sovik’ 


Huang et al. (1) make an exciting claim about a human-like dopamine-regulated neuromodulatory 
mechanism underlying food-seeking behavior in honey bees. Their claim is based on experiments 
designed to measure brain biogenic amine levels and manipulate receptor activity. We have concerns 
that need to be addressed before broad acceptance of their results and the interpretation provided. 


iogenic amines drive the motivation to 

find food—“wanting” behavior—in mam- 

mals (2). It is important to establish 

whether these mechanisms are phylo- 

genetically conserved in insects or wheth- 
er they have had different origins and endpoints. 
Huang et al. (1) use HPLC to measure dopa- 
mine levels in individual brains of foragers at 
different phases of food seeking. Then, using 
manipulations of dopamine receptors, they 
claim that blockade or activation of dopamine 
receptors affects food wanting. Given the im- 
plication of this claim, that there is a phyloge- 
netically ancient mechanism for meditating 
food wanting, we feel compelled to engage in 
a constructive conversation about this impor- 
tant work. Our primary concerns center around 
two questions. 


How can the use of HPLC on whole 

brain extracts resolve the reported changes 
in biogenic amine levels across such 

short time scales? 


Biogenic amines can be measured in several 
ways. Changes in biogenic amine levels in 
small areas of the brain (3) can be detected 
using microdialysis or in vivo voltammetry, 
both of which measure changes in extracellu- 
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lar levels that reflect the acute release of 
amines and the efficiency of their reuptake. In 
what might be characterized as a relatively 
fast, zero-sum game, biogenic amines cycle 
from being vesicle-bound inside the cell, to 
released and acutely active, and back (4). 

In contrast, HPLC of biogenic amine levels 
extracted from whole brains does not specif- 
ically measure active, acutely released biogenic 
amines. Rather, whole brain sampling pro- 
vides the total of the intracellular reserves and 
extracellular pools of biogenic amines: a total 
that at any point in time is assumed to reflect 
primarily biogenic amines stored in vesicles. 
Therefore, changes in biogenic amine levels 
measured in this way reflect a slower, changed- 
sum process. To date, global changes in biogenic 
amine levels in bees have reflected differences 
in behavioral state that take place over, and 
persist for, days. This is a time frame over 
which changes in synthesis and degradation 
could leave their mark in brain-wide, HPLC- 
based measurements of biogenic amines. 

As changes in acute release and reuptake are 
not expected to appear as dramatic changes in 
total brain biogenic amine levels, this brings 
into question the results in figure 1 (7). The 
authors report changes in brain-wide dopa- 
mine levels on time scales that do not seem 
physiological. For example, the authors report 
dopamine levels going from 200% or 300% of 
baseline and back within one or a few min- 
utes, which reflects physiological events not 
yet documented in any animal. We also find it 
unlikely that the reported rapid changes in 
brain dopamine reflect release into the hemo- 
lymph (blood). For all insects examined to 
date, dopaminergic neurons do not project to 
the periphery, unlike serotonergic and octo- 
paminergic neurons (5). Dopamine is pre- 
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dominantly confined to brain compartmg Check for 
There is no quick way to release dopant 


into the hemolymph after its action on the 
brain. 

Two additional concerns are important in this 
context. First, Huang et al. utilized between- 
animal sampling at different time points— 
prior to or after dancing or feeding, for exam- 
ple. This method of sampling contrasts with 
the within-animal method typically used to 
study mammalian brains, wherein biogenic 
amine levels before and after an event are 
sampled in the same individual. The assump- 
tion here is that an animal that is sampled 
before an event accurately reflects the initial 
state of a different animal sampled after 
an event. If that assumption does not hold, 
between-animal sampling can give rise to er- 
rors. There is not enough information in the 
methods to indicate what precautions were 
taken. Second, all other studies using HPLC to 
measure biogenic amine levels in honey bee 
brains have quantified octopamine (6, 7). As 
octopamine has been consistently implicated 
as a mediator of reward in honey bees (8, 9), a 
potential role for octopamine in driving food 
motivation must be considered. Huang e¢ al. 
report that octopamine was not consistently 
observed in their samples. This omission raises 
the concern that dopamine and octopamine 
may have been co-detected as a single peak, 
particularly because of the pH of 4 reported 
for the mobile phase, which is lower than the 
PH used in separation of dopamine and octo- 
pamine in other studies (10). The authors also 
report a temperature of their mobile phase of 
40°C, which is potentially damaging to heat 
labile amines. 


Conclusions 


The claims made by Huang e¢ al. about a 
dopamine-based drive for food wanting, and 
its phylogenetic homology with mammals, are 
exciting. However, for reasons described here, 
we feel that such claims are not supported by 
the data. 
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Response to comment on “Food Wanting is Mediated 
by Transient Activation of Dopaminergic Signaling 


in the Honeybee Brain” 


Songkun Su’ and Martin Giurfa?>* 


In a technical comment, Barron et al. (1) criticized the work of Huang et al. (2) putting the accent on 
the quantification of dopamine levels via high-performance liquid chromatography (HPLC), yet also 
including data interpretation through alternative hypotheses aimed at invalidating the original ones 
proposed by Huang et al. We thank the authors of this technical comment, which allows us to clarify 
technical aspects of our work that may have been unclear, and for promoting discussion around the 
conclusions of our work. Below we provide answers to the points raised in their comment. 


Use of HPLC on whole brain extracts to detect 
biogenic amine levels 

We appreciate the comments from Barron e¢ al. 
(2) on the dynamics of dopamine release and 
reuptake as detectable by HPLC measurements. 
We understand the concerns expressed about 
the speed of changes related to dopamine lev- 
els. However, similar variations of dopamine 
levels have been observed in analyses performed 
in Barron’s laboratory (3). In this work (3), 
large changes in brain levels of dopamine were 
reported for honey bee guards within 3 min 
of treatment using HPLC methods, similar to 
Huang et al.’s work (2). These changes were 
significant, occurred within short delays, and 
were associated with behavioral relevant events 
of different nature (e.g., exposure to disturbance 
and alarm pheromone components). Concerns 
about the dynamics of changes in brain do- 
pamine levels should have been then raised in 
this case too but we are not aware that such 
concerns existed in that case. The fact that only 
bee guards were used (i.e., a single behavioral 
state) in this study shows, in addition, that it 
is incorrect to affirm that global changes in 
biogenic amine levels in bees have only been 
related to differences in behavioral state. More- 
over, a recent work by scientists with whom 
we have no relationship has showed a similar 
rapid increase of dopamine levels in the brain 
of honey bee dancers compared to non-active 
foragers, and fast decreases in brain dopamine 
levels in the case of active foragers that ex- 
perienced aversive encounters with predators 
or received danger signals within the hive (4). 
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This work used similar HPLC quantification 
methods and showed in addition that enhanc- 
ing artificially dopamine levels via oral delivery 
in a time-scale similar to that of our treatments 
reverts the negative effects of aversive experi- 
ences on foraging activities (4). Although this 
new evidence is consistent with the findings 
reported in our work (2), we agree that it is 
important to determine in future studies how 
individual brain levels of dopamine could be 
rapidly lowered and enhanced following aver- 
sive and appetitive experiences, respectively, 
and thank Barron et al. for a stimulating dis- 
cussion on this point. 

Barron et al. mention two additional con- 
cerns, namely the use of between-animal sam- 
pling at different time points, and the lack of 
quantification of octopamine in our studies. 
Concerning the first point, the fact that within- 
animal methods are typically used to study 
mammalian brains does not mean that the 
same methodology is possible in the case of 
insect brains. Barron e¢ al. are well placed to 
know that there is no way to capture an insect, 
sample biogenic amines from its brain, which 
implies destroying the head capsule and the 
brain tissue of its miniature brain, and then 
release it with the double unrealistic expec- 
tation that 1) the animal would prosecute its 
normal behavior (foraging, dancing, etc.) and 
that 2) it would be possible to capture it again 
to re-sample the biogenic amine studied. In 
this implausible scenario, the first sampling 
would have immediate lethal consequences. 
Thus, the fact that this methodology is used 
in vertebrates is not a justification to claim 
that it can be used in behaving insects. The 
method we used—extracting brains to perform 
single-brain analyses at each phase of interest— 
is the sampling method that is available for 
insects and that has been used in many works, 
including those of Barron et al. (e.g., 3, 5, 6, 7), 
which should then be addressed by the same 


we will certainly apply the methodology once 
made available to the scientific community. 

Concerning the second point, the lack of de- 
tection of octopamine was certainly frustrating 
as we aimed at quantifying it, as clearly men- 
tioned in our text. The detection instrument we 
used was a High-Performance Liquid Chro- 
matography (HPLC; UltiMate 3000, Thermo- 
Fisher) with Electrochemical Detection (ECD; 
COULOCHEM III, ThermoFisher) equipped with 
single electrode analytical cell. Generally, dopa- 
mine and serotonin can be detected at 350 mV, 
while octopamine and tyramine produce large 
peaks at 650 mV (8). Thus, we could not simul- 
taneously detect dopamine, 5-HT and octo- 
pamine. Further experimentation would be 
required to assess octopamine under similar 
experimental conditions. In other studies, a pH 
of 3.6/3.7 was used for the detection of biogenic 
amines (e.g., 9, 10) and it was certainly suitable 
for the separation of dopamine and serotonin. 
In addition, we used an Acclaim C18 reversed- 
phase column (2.2 um, 2.1 x 150 mm) in mobile 
phase with the temperature of the column oven 
at 40°C following similar settings as those 
used to reliablyquantify serotonin levels in the 
digestive system of bees (9). A comparable tem- 
perature (39°C) was used in a similar setup 
to detect brain dopamine levels in honey bee 
foragers (4). Thus, stating that these temper- 
atures are unsuitable for quantifying biogenic 
amines is inappropriate. Overall, the parameter 
setting of our analyses complied with existing 
works performed on honey bees, which makes 
us confident about the detection of dopamine 
and serotonin reported in our work. 


Discussion of alternative explanations 


Barron et al. (1) propose an alternative expla- 
nation for our findings, namely that topical 
exposure to dopamine would increase internal 
levels of adipokinetic hormone (AKH) released 
by the corpora cardiaca outside of the brain, as 
in fruit flies (the reference they provided: (77). 
This argument is surprising as it ignores basic 
facts about the biology of AKH in bees, which 
differs drastically from that of fruit flies. As 
mentioned repeatedly in our text, we used only 
forager bees that were active throughout our 
experiments. In these bees, which lack fat body 
storage for carbohydrates (12) and have low 
glycogen reserves (13), several lines of evidence 
indicate that AKH has lost its original func- 
tion due to the social life style (74) as foragers 
rely on sugar available in the collective stores of 
the hive rather than on individual fat body re- 
sources (73). The comparison with flies is thus 
inappropriate as solitary insects depend on fat 
body energy stores (lipids and glycogen), which 
is not the case for honey bee foragers. 

In addition, the hypothesis of Barron e¢ ai. 
(1) is refuted by explicit tests of the effects of 
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AKH on appetitive behavior of bees, includ- 
ing food ingestion and sucrose responsiveness 
(15, 16), which they claim should be increased 
upon enhancement of AKH levels. Consistent 
with the lack of functionality of AKH in bee 
foragers, recent results showed that AKH lev- 
els do not vary with starvation (15) and that 
AKH does affect neither ingestion of sucrose 
solution nor sucrose responsiveness in forager 
bees (6), contrary to Barron et al.’s expecta- 
tions. In addition, no effect of AKH on the 
proclivity to respond appetitively and ingest 
an unpalatable mixture of sucrose and bitter 
salicin was detected (16)—different from what 
is observed in flies (77)—showing that the hy- 
pothesis of a possible indirect effect via AKH is 
not tenable and that ignoring basic differences 
in the physiology of solitary and social insects 
may be misleading. 


Conclusion 


We are thankful to the technical comment by 
Barron et al. (1) insofar as it allows us to clarify 
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technical aspects and statistical procedures of 
our original study (2). While constructive de- 
bate is always welcome when novel results are 
made available, requests for unfeasible experi- 
ments (i.e., within-brain sampling of biogenic 
amines in different behavioral contexts), crit- 
icisms of HPLC methods that have been proven 
useful in other works and the proposal of alter- 
native hypothesis that do not consider basic 
facts of honey bee biology do not necessarily 
advance us in the understanding of the neural 
bases of honey bee motivation. We thus prefer 
to focus on the valid comment stating the need 
of clarifying how brain levels of dopamine can 
be rapidly lowered and enhanced in the bee 
brain, a result that has been confirmed by re- 
cent work (4) and that should promote future 
investigations. 
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How art improves my science 


ou like pottery more than you like science,” a family member said at my 25th birthday party. I 
knew the comment was made lightly, but I felt frustrated and disappointed. I had worked so 


hard to achieve a balance between my scientific and artistic pursuits. After experiencing doubts 
about whether I could, in fact, do both, I was convinced that my creative outlets complemented 
my science rather than detracting from it. So why was my creative side still seen as a weakness 
and a distraction, in opposition to my science? 


As a kid, I created art by smashing 
up eggshells, dying them, and gluing 
them to a canvas. I also performed 
science experiments with my mom, 
a high school science teacher, on 
the back porch. In college, I loved 
my chemistry courses and research 
work, and I embraced cooking as my 
creative outlet, hosting weekly din- 
ner parties for friends. 

Yet I heard time and again that 
creative expression and _ science 
were incompatible. When my col- 
lege roommate concluded with a 
personality test that I was the “cre- 
ative” type, the rest of my friends 
said that couldn’t be right, be- 
cause “I was a scientist and there- 
fore was not creative.” When I tell 
people I’m in a chemistry Ph.D. 
program, they often respond with 
something along the lines of, “I could 
never do that because I am more of the creative type.” 

At times I have internalized this misconception. As I was 
preparing for graduate school and studying for the chemistry 
GRE test, I realized I talked about science the same way I 
would paint a picture, explaining concepts in broad strokes— 
in part because I struggled to remember technical scientific 
terminology. I worried my way of thinking wouldn’t fit in 
with “proper” scientists. In the end, though, my desire to pur- 
sue a scientific career won out. I figured I would continue to 
find a way to balance my research with my creative pursuits, 
regardless of what others thought. 

I started during my first year of grad school, when I served 
as a teacher’s assistant for an analytical chemistry lab course. 
Todd, the professor leading the class, spotted me watching 
him fix glassware on the torch and offered to give me a lesson, 
and I became obsessed. I learned how to repair equipment as 
well as artistic glass blowing techniques. Todd bought me a 
plethora of colored glass, which I transformed into pendants, 
ornaments, and animal statues. I felt like myself, reveling in 
my scientific studies while also satisfying my artistic side. 


“needed a creative outlet to 
keep myself going; it just 
had to fit within my schedule.” 


But once I finished my teaching 
assistant duties and no longer had 
an excuse to go to the analytical lab 
on a regular basis, finding time for 
glass blowing became difficult and 
my research began to suffer. Maybe 
science and art really were incom- 
patible. Or maybe I simply didn’t 
have time for both. Either way, I 
felt that to succeed as a scientist I 
needed to make a choice. So, enter- 
ing my third year of graduate school, 
after blowing glass for about a year 
and a half, I gave it up. 

But I still found myself struggling 
with my research, just in a differ- 
ent way. I was starting to feel more 
burned out in the lab. I had a hard 
time brainstorming innovative solu- 
tions. I had more time than before 
to dedicate to my research, but I was 
missing something that was perhaps 
even more crucial. I needed a creative outlet to keep myself 
going; it just had to fit within my schedule. 

This was when I found pottery, which I could pursue on 
evenings and weekends. After my first class, I was hooked. 

My family member’s comment about my love for pottery 
shook me but did not halt my passion for the craft. In- 
stead, it has inspired me to more intentionally mix my art 
with my science. I keep a notebook at my lab bench filled 
with doodles of mug designs inspired by my experiments, 
such as my work investigating copper’s ability to cause 
protein precipitation. Through the challenge of distilling 
my paper into its core concepts in such a way that it could 
be displayed on a mug, I gained a better understanding of 
my project. 

Pottery has now turned into a way for me to train my brain 
to be creative and think outside the box. It helps me learn how 
to take a lump of clay and turn it into the unimaginable. 


Amy Robison is a Ph.D. student at Duke University. Send your career 
story to SciCareerEditor@aaas.org. 
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CLIMATE CHANGE AND SUSTAINABILITY 


The Bieler School of Environment and the Department of Natural Resource 
Sciences at McGill University invite outstanding applications for a jointly 
appointed endowed Bieler Research Chair in Northern Climate Change and 
Sustainability at the Associate or Full Professor level. 


The successful candidate will be a rising or established leader in natural 
sciences, social sciences, or economics research on climate change with 
a preferred geographic focus on Northern regions and Indigenous issues. 
The Chair will be internationally recognized for advanced contributions to 
research on biodiversity, climate change, ecology, ecosystems, environment, 
and/or sustainability. Candidates must have demonstrated extensive 
evidence of peer-reviewed grant funding and publications, and have strong 
academic, government, and/or private research networks. 


The position will complement existing strengths in wildlife and fisheries 
biology, forest and landscape ecology, entomology, microbiology, soil 
science, ecotoxicology, economics and natural resource policy in the 
Department of Natural Resource Sciences, as well strengths in biodiversity, 
change mapping, transportation, human well-being, and institutional and 
societal change in the Bieler School. The successful candidate will also be 
able to contribute to mentoring and teaching excellence in the Bieler School 
of Environment and in the Department of Natural Resource Sciences. 


For details and to apply: https://tinyurl.com/57r6mwjy 
Questions concerning the position can be directed to Frederic Fabry frederic. 


fabry@mcegill.ca (Director, Bieler School of Environment) or Brian Driscoll 
brian.driscoll@mcegill.ca (Chair, Natural Resource Sciences) 
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